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ABSTRACT 



Bi« STUDEFTT ifrotLBns Bolvlng ayst^ets pr&erinraiied in LISP^ oc~ 
cepts- a* inpuc a comfortable but restricted subset of Enj^lisli i*hlch 
tan VK|]r««ti a wide variety of algebtii atory problems. STUHENT flndE 
the eclution Co a large class of thee* ptoblemE, STUDEHT can utilise 
a Store of global In format tqn not Specific ta any one pTToMem, and 
mHy mijfce assumpCiona abciyl; the interpretation of aKiblguities itt tb6 
wording of the problem being aalv6d+ If it uceB sucb Infenfmation, 
or makps any acaunipt iona ^ STUDENT tOmmuntcateE this fact to the user. 

The thesis includea a *i.irti!iiaify of other Enfilieh language quea- 
tion-anBwering systeasE. All th**c SyEtemaj and STUX»arti- flto evalu- 
Uted according to fciui' standAril Criteria , 

The linguistic analyils In STUBENT is a first JippToxlmat loti 
to the analytic pert ton of a semantLC theory of diacouiEe out 1 in* J 
in the thesis. STUDENT finda the set of Itsmel »entenc*a which are 
the base of thft Input diBcourBe> and tr^tni; forms this sa.gvetice of 
kernel sentences into a s*C of aliYnilianeous equations wliith form the 
fleartflntlo baj* of the SIUDEHT eystrsm* STUDENT then trios to solve 
this set of equations for Clif: vslLit^s of rtqueet&d unknowiB . If It 
is success ful it gives thv ^nS»fers in English, if not, STUDEMT asks 
th4 user for more informatioili atid indicates Ch« nature of Che de- 
slreiJ information. The STUDBST system is a flrit Step tovard natu- 
ral Ungu.-tgc oonEunic at ion with computers. Further work on tlie ec- 
wantlt theory propoa&d abOuW T9«ult in much tuorc Eophisttcated 
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CSAFT^ii II UnWlC^CllON 



Ijhe aim o£ Che research repidifCed Itert tra* to discover how 

OiPie could fruild a tomfut f:r pru^rarn ^Ith could ccumnunicaca with 
p«Opl« in « nsCur^l Urtgu«g« Vlthin t^n* reitticted pToblem domain. 
lb the course «f this invent ie^itictij t i^n^tc « »et of conpiiter pro- 
grams i the STUDEFT fyitititj which 4CC«pC& ^is input a comfortable but 
irectTiCt-BiJ subset of Englieti which cJn be used to expresB a wide 
variety of algebra iCory pnablamSi the problems ahown in Figure 1 
lllM«trat€ some of Cbt CidniTiuniCJClc-n and problem solving capabil- 
ities at thia aystem, 

I& Che following discussion, I ahtll use phrases auch aa 

"tbe comp^tCe!;: undereCdrtde English"* In iU &uch casEB, the *'En- 
gli9.hi*' is Just Ch« rtltficted subset of Engli&b which la allowable 
as input for the tonrputcr progrdm under discuasion. In additlon> 
fi^r purposes of Chli r«i»ort I hirve adopted the followitig operational 
definition of understanding. A computer iicaderfltaTtds a subset of Ea- 
gLlsh if It accepts input aectencee vhicli are uembci'S of ChlB subset, 
amd dnswcTS questions baaed on inforaatian cr^^ntained in Che input. 
The STUI>K!^ Bystem understand e English in thi? sens«i. 



A. The Problem ComtiegBt of the SltJPtarr ijijj^tcTnt 

In COnsCtucClng a quest ion^ansverlng system, many prohleitiR 
are firestly flliir|;jlifled if the problem context ia reaCrietedt The 
aimpLiflcation result irtg frocn th« reitrictlona embodied In Che STU- 
DENT &yE.te[B, and the rea»Qvis theSfl sinplif icat iona arLae,. will be 
diacu&sed In detail In tlie body o£ this report. 

The STTTDEHT eystenL is designed to answer Questions embedded 
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Is Englisli laci^uAg.? St^temsntB of algebra ^tpfy fir^blcras such as 
those Bhown in E'£guV« 1 ■ STTTDEHT does this by COfiS t tutC Ing frcoi 
Che Etigliflh input a tiOTTflspoTiding aet of aLgebeale equilCionB, and 
Bolving this aet of «qu4ticiiis £qt the requaated utiknowriB . If 
needed, STU&EKT ha* access to a a Core of '^global" infoTmation, 
net apeciflc to any porticulaT pToblau, a fid can retri-eve relevant 
factfl Bfud &^u,aeloEid from this atoTe of iflif(>ftn.^t ion- STVDEMT cwrfw 
laent* Oil ite pr[>greaa tn solving a prflKlera^ and cam request Ch* 
1l*lg> cf the quaatianer if it g,4t< £tuck» 

There are a nuiiiiber ud raasoiu tAj I choae the eontaitt of 
algebra story prciblerae in which to develop techniques which would 
alLtfw ^ cofmputar probleifi aftlving system to accept natural language 
Itrp'ijt:* First, we know a gocnJ type of data atTucCurft in which to 
Itor^ informartion needed Co flnSwer questions in this t^pnteKt t 
aanaly, algebraic equatioAj, There exist will kfljOWn algotithma 
£of deducing inf ormat lusn traplieit In the ecjuattonSi that ia ;^ 
V4lii«« £oi particular variables wtich aatisfy the Set of squat lona. 

In jtdd it ioTt , I felt that there waa a manageable subaet of 
English in which many typ^^ of algebra atory problems were ex* 
press lb le* A large number of these story pro^l«nia are available 
In flrat year high achool taxt boofca ^ and I have tranacribed SOine 
of than into STUfGNT** input Engliah. Since thta q ue at iott- answer- 
ing task ia one perfutmerf by hunuana, and einCe the enClre proteaa 
from input to solg:tien of the equationa wae programned , W4 can ob- 
tain a meaaure of tOrrpSriaon between th* performance of STUDENT 
snd of a human on tht sacne problems. Jn fact, chia program on an 
ZBl 7094 answera n»4E questions that it car handle aa fast or 
ffiater than humaiw trying the aame problem* In judging this com- 
pariaon, one ahould rrmsmbar the baa a apted tif the IPM 70? A, which 
oan parforv over one hundred thousand additions per second. 



B* RcagQoa ^QT Want lictE. Natural Language Input. 

Why «h<Hild {tan vent to talk to a coiqputar in English^ There 
«re many tottgucs Che coaputer aLready luudcTBtanda -^ such as FORTIIAN, 
OCMIT, LISF, ALCOL, COBOL, CO rtatne jMSt a f«w- theae serve ade- 
quate Ij a« coBiBunicfltujn cnedia vlth the cowputer far a Larg.e clasG 
of probLemSi A tdotcs pcrCintnt qutsttoil i« ceslly* when ts Efli&llflJi 
input to a ccmputeT desirable? 

S&SLlsh input ia dealrable, for eKaniple', if it i9 nec^^sat-y 
to use the computer fen: reCrlev^L oi Informaelon frtm a teKC itl 
BngLish. If a coinputeT could accept English inputs much infoEmatlon 
tUMf recovdled only in English vouLd he ovailjable- tot compuCcir uae 
without need for human Crana LaClonr 

A CojMguttV which ^t^erat^Dod EnglLali UDuld be mjore acceaalble 
to dny epsakcr of Eng;lial^i» vhether ai mat he was trained in any 
"foTEiEn" ■computer tTmeiiC+ For a single ahot at the comipuCer with 
a queation not likely to be repeated, it would not be worthwhile 
to train the ubet in a ipeCiaUzed language. For fact retrieval, ' 
tsCher Chan document retrieve!* Efligllah is a gopd vehicle far 
ataClrtE queries* For # gooti tt«5Cif ifitlw* ot the differencea between 
tact and dpcuinent retrieval^ 9^it CoOpG^r {\Z}* 

programming langu.a^'as ^X% prscieas prienCed. One cannot 

defletlbe a problem, only s tnectiod for Etnditlf a aoluCinn to the prob- 
Inn* A natural language Ib U convenient vehicle for providing a 
doeeription of the probletn itself > Leaving Che choice of procesaing 
to the problem aolver accepting the input- In an extreme caae, one 
woulJ lik* to talk to the computer aboiiC a probleaa^ with appropriate 
dijeaCio-na and interjei^t ic>Os hy the computer cm aasumptLona it finds 
neceaaary^ until tbe computer claims that the problem i« xu^h veil 
fonnedi and an attempt at soluticn c^n h^ mad^* 
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FinalLyj inanVB abilttjf to use synibolE and Language: Is 3 prtiiie 
fuctcjT in hla Intelligence] and tf we can learn how to ' sake a ccm- 
puter understand a naCaral UfiguJ^e, we will have tak«n * big step 
toward creatins an ^"aif tifitis 1 Ly IntelLigenC^' contputet (32). 



C- Criteifj^a far Evaluating Qu.gation-Anawgtrtnft Systgrna, 

Me h*ve defined undeTstanditiig Irt titra^ of an ability to *n- 
ewtr qu*»tioii* in EngLiah. A ntumbtr *f quest ton- anEwerlng SySteiQB 
have |3ft«n built, and will be deaetl^&^J In the nsKt sett ton + In thlE 
aeeCiOn* we Khali give a number of Cfiteria for evaluating quest ion- 
aflSwe i ing s ya t eras . 

In many ayst^m? there is A separation 9f data input and eiu**- 
tion input. For a.U •yiteme under cons ld« re t Ion , the itiput quest iona 
are in En£li«h» The Input data may fee either in IngLl^h or in a 
pTe9truCtuT«4 format, e.g. a free &r hierarchy. The En^liah data 
inpiit may be used aa a data b^at S9 1b, or mapped ititp a structured 
infonsiat ten Store i Slmiionsj In his twnpetent survey E>f English ijues- 
t ten- answering systeraa <40l>j call* those ays tenia using a atrLieturCH^ 
tnforma t ion store ^'daCa base quest ion-atieuerers", aa opposed to 
"teJCt-baeed question-answerers" which retrieve f*Ctc from Che origins! 
texiti 

The eKtent of understanding of a ^ugation-anau^rlfiij iyetem 

can be measured along three different dlmenSione , syntacttt, seman- 
tic and deductive. Along the syntacCit dimension one CSn measure 
the ETSirmatleaL Complexity allowable in input sentett'CeS * This may 
differ for the data input and quaatiiSn input. In th* s imp 1 eat caacj 
one or sose iWalL number of fixed format senCencea are Allowable In- 
puta. Lesfl; rei!ieritte4 inputs may alluw any ssntencea ■tjhtch can be 
parsed by a fiKe'«!l graimiflr. The nearer thi* graaniar Ig to a grammar 
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of all of Eivg:li«h, th« l4«4 fe^CrlcC&d j.e the tnputi B^vaut^E CsxC- 
biiBsd queEtion-EinBvcrcre ^(^t^t its Inpcit ^Ay ittin^ of motdn, wlth- 
ant further processing, Chey hsve no syntactic 1 imita C ion, on input, 
HawevfiT, the fact-retrieval pri^grsni may only ba able to abstract 

inf^TiaaClon from those partitvns of a text with Leas thdn afme maxi- 
ERUn sytltSCtlc: -complexity. 

Itl data base queatian- answering systema, only csrtain rela- 
tionships 'beCveen word a, or abjecta^ niBy be rapresentablB In the 
inf omUlt loit tt^tt* 0th tr infaimation Tuay be diacardad or ignored. 
ItilB ia a limltntlcn itt Ch& sananCic dlmeixsian of understanding. 

In ordar to obtain «ns^er» to i^w^sti^ns not estplicitLy given 

in the input, a quest! en- ^nSw^rlng system DhiSC have Che j^O^er to p«ir- 
foru aoffie deductions, The structure of th« Infornstion 9C»re ma^ 
facilitate such deductive abiltty+ The ran&e of deduct l-Ve ability 
ia oeasuTed along the de<ductive dimension of understandingf Tti« 
structure of the inforniition Store may al*o aid in selecting only 
relevant maCErial for ui* lit the dedwCtlv* queatioti-flnawering pro- 
ceEB, thus iBipToving the efficiency o£ th* ^yatetti. 

Another criteria closely relAted lo the extent of uOileif- 
»(;:$ndittg] Is the facility Krith which th« «ynCsttlC:» semantic ^ or 
4B<luttlve sbHitiea ot a quest ion- answaring SyttBui C4n be extended <^ 
In ch^ best case one could Improve the ays C em $long &ny d liniettS Ion 
by t^lkleij^ t<* it in EngLlah. Alterniltiv^lyj «ne nigbt bave to add 
6<m\6 nev pragrama to the syEtenn, or 4t wpfSti. Biiiy cban^e ttiight Intply 
cofiiplete repri>gi'aiiiivilng of the entire SySteHi + 

An Importamt ddditiOTial consideration Cor users of a quea- 

t ion- answering system is the amount of knowledge of the Internal 
atructure of the system that is neceaaary to use It. ^t best one 
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need not be aware of Clie Ittf &rfnat toil itorage sCructure ys«id iit all. 
At worst, a thorouEh ktitswl&dge of the internBl b C rutturft mSy be nec- 
estjary to conatTucC suitable Input ■■ 

Another measusr^ of th* uie fulness of a qut^tlotl-^nEtferlng 
system is its- ability Co ttttttact with the uaer. Iti the worst case, 
3 question la a^ked atwi •■OmetimB later an answer or report of fail- 
ure ia stvenp When cShe quest imi cannot b* ani^TBtsd^ no indicfltlon Is 
given of t'he cause of fjiilure,. nor does tlie ^j^item allov the pef^cm 
to give any help. This 1b typical of the i3per*tion of a TiLinrbef of 
Air Force qyttry syststas (Jay Kayaerj, personal comnunicaCio-ti) . In 
the b^st tiiSCi the syctem will ask tht user for specific help Pnd 
acce|it Suggest ions of appToprlate c&urste of action. 

In thia aectlon we have &lven four criteria for ev* Itj-B t Ing 

q«eition-?inBwering aystems. they fflay be i uuiiib r tzed as fiallovs: 

1) Extent of under stafuding; (syntactic, semantic an<3 de- 
ductive abilities) 

,2) Facility for eif tending abilities (syntactic > Smaantic, 
deductive) 

3) Need by user for knouledjgiB of internal structure o£ 

E ys t em 

4) Esttent of inter*ttlon with uaer 



H* EnitlUh Laneuage Queat ion- Answer ina SyECecne. 

In thia aectlotij J shflll give a critical sunm^ry of a njtimhec 
of English language queetion-JnSweriDg syeterBE , at S. lining the cri-- 
tett* outlined in the prevtoui ficctton. Thia diJtuiBion will providt 
* Context for the section of tbft CocLcluding chapter which aumniarijta 
Che capabilities of th« STUDENT ayatem. For « dg«CTiption flf the dtf- 
ftrvnt ayntflctie analysis a-chcmes mentioned belowj see Che aixrvey by 
Bobr^?^ (4) t 
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1} FhilUpe^ One of the earlieit <jue a t ian- a nawer tug fiycteiBB 
vas written in 1?60 at HIT by ilnthony Fhlllip« (3S>. It is b daEa 
base ayatem which accepta aenteiucee which ttti bt pfl;r3ed by b very 
3iiiip].e ecintext-free phraae atructutc granHflarj of the type defined by 
Chomsky (8>+ Ad41.tl.onaL ayntactic TQSCrLtCi&ClS req^itre that each 
word must be in only one graonnBtical cldsei, snd tttdt s sentence haa 
es^actLy one psmin^f 

A p«ti^d e^rtt^oce ts traneforHie^ into a Ltat of five ele- 
nentap tlie aubjett n verti> abject, tijne phr*ae> and place phraae in 
the aentencet All f3th«r InfarmaCion in the **ntenc« is die regarded. 
Quest toriB ate an-SwctccJ ^y taaCchicig the Liat ftoni th* tr ana formed 
qtieatton against the lict ffrir each input aentsntci Wl-iefi a match ia 
iOvM, the correspond! ing, «*tltente ia given as an aniwer* 

fhillips' EyeC-BHi has no deductive ability and adding new 
ab tilt tea ^HHild require ffrpt^graiiwiing the syBtem* A quest iaaeif nwiat 
bv iiv^EB fhat the ByECem tit i 1 i£*a & matching process which do*a fiflC 
t<!C0^ni?e BynonyniB, and therefore the sentence "The te4£h«ir «*tffl 
lun«h JBt tWKm/^ will not be t^Ca^niiid as an anawer to th? q^jcdtiotli 
"What duBi the teacher do at twelv* d^ clock?" iTlien PhlLLipe' Eyatem 
cannot find m answer, it report* enly ^'(IHE ORACLE DOES HOT EHDW)"* 
It presides for no further interaction with the uaer* 

2} Cyi^en^ Paaeball t* a qu«SClon- answering system dcsiftni'^ 
and pfograaYBied at Lincoln L!lb<?fatciifleB by Qreen, Wolf, Chomeky and 
Laugh*r¥ <19>- It la a <i-^t^ tha*« aystein, in which the data ia placed 
in Tiiefflory in a preatructur^ij tr«e format. The data Constati of the 
d«te<» location, opposing team* atid scores of sonie American League 
baaeball games « Only qucationa to the syBtenv can be ^Ivett in Engliah, 
not the data* 
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Qusstions mLiBE be sLmpLe senEenc&e, vltti no reLetlv£ 
ciduees, logiral or coordinate coftniCtives + With these reatirlttl&tia * 
the proETam will accept any qM*st£i3fl CdteJiBd in words contalnfld in 
a vc:?C4bul4rj list quite adequate for a^kiti^ '^'ucBtiona about b^3e- 
ball etflt titles. In add it ion ^ the parsing ro-utltie, based on tech- 
nLiJueft deiveioped by Harris <2l>> tncist find a parsing for the ^u«StL[>n + 

The questions blibC pertain to StitiecicB about baseball 
gafnei found In the InfoTTaBtlen stere* One cannot aek qt^e^ttOH* 
abutit KKt remap ouch as "high 6a C" 9 cere or "fewest" i;\M«ber oi ^amee 
»*h» The parsed queatlon 1^ t r*fi9 formed into a aCandsrd 9 p«Ct fixa- 
tion (or spec) Listp std the q ties t ion- Answer ing, rouCitie tit 1 Use* 
thifl cancmicsL form foif Clie uneatBtng of the que at ion » For citample, 
the queation "Who beat the Yankees on July AV* vould be transfonned 
into Che "spec Iist"t 

leam tiogtrtg) " Nev Tork 

Teajii (wimfltfig)- ? 

Date - July 4 

£ecisu96 Baseball do^s not utilize English for date inputj we 

cannot talk ab&iit; deduct lone oiade from infovmfltion impiicit in seif- 
eral 3etltenteS+ However, Baaeball csn pfirform operatione gyeh a 
counting (the number of gameE played by Boston, for eMamplfe) 4nd 
thuE in eht Sense that it is utilizing 9(.iv«Tai separate dat^ units 
in its etorej it is performing dadLictl£^E^9 + 

BaEeball'a afciiities can only be eH,t«nded by eKtenalve r*- 
progranmlng , though the techniques utilised h*ve some getiersl sppLt- 
cability. BecauSft the parsing progTsm has « very complete gr^iimirTrH 
and the vocalmlaxy list is quite corop s: eh CnS Ivc for the problEin domain, 
the user needs no knowledge of the interns 1 structure of the 'Ease- 
balL program. Bo provisidn for Interaction ulth the user was iaade: + 
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3> Strnnonfl, The SYHTHEX system is s teatt- baaed quest ion- an- 
euei^i-ng system deslgQed and pro^trtiiintiil St SSC by gifwn()n3> KI«iTi ^tul 
McCufiologU'e C^l-)' The entire cont^n!:* of s cbJ-J-jiren' * encyclopedia 
has been ti'Anscribed to magnetic tap€ for U9c las the tn forma t ioti 
ltDte+ An Index haa been prepared liBting the lo^^tion of ^ll Cbe 
Content word* in the text, i.e. intiluding word* Like "woif*,''' ^^eaCj" 
an4 "btrdij" while eKcLudiiiB funcittcj-n wor-ds Like "ands," "the," a^d 
"of*" All th* cefitefit words of a que at ion are extrdotad, aiwl In- 
formStlOn rtch sectipsis of the tex.t are retrieved i ii«i A^^Clons 
thfit rire locally dense in ccmtent varda cort dined in tha q^j^eCiOA* 
Far example* if the [|ue3ti0t\ w-nxn ^"Vhat do vorms eatf*, with 
cantect words "worm*" and "cat^% the two aentenceo "Birda eat worms 
on the grasiiii" and '^HJOjit l^rbOB usually eat grass." laight be retfievcdi 
At this time, the program pcrforwe a ayntactic analyaia of the ques- 
tion fl;nd of the sentences tJi^^t iinay contain the ansuer. A comparisoii 
o£ the dependency trees o£ tli'e quoStJL&n and various aenCencea may 
eliminate some irrelevant eatLt-etittS * In the eKample, "^'Birda eat 
worms on the grass" is aliwinat&d bc^oamae "wBriaa" la the object of 
the verb "eats" instead of Cha aubjett ifv in the <jueaClon. In, the 
general case, the remaining ««Ttt&nt09 are 8.iveii in aooie ranked order 
a* ipossibly answering the quaationit 

SYW'rtlBX is limited ByntaEtitsIly by tt:S ^tsuuair to the sk- 
tant that the syntactic ana lye is ellmtnates t if relevant atstements. 
It makes no use of the meaninB cjf any StatCTftetita QT wordaj and cannot 
deduce answers fram infoTHiBtloTi implicit in tWi or more aentences. 
Becduse tlie graiaaBr la indapamjant of the prc^tuni the syntactic 
ability of S'fHTHEX -can he extended relitttvcly easily* However, be- 
fore It *afi >ec<H»e a goncwS queat ion- gnawer irg -SyStenij spffle aemantic 
fli^tlltiea will have to be added* 

- SYimiEiE dofef not ex|>licitly provide for intet action with the 
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usfiT, tmc ^gtsase tt Is iTEplematited tn th* SDC t Ime- aha r ing syeteiB 
(9), 3 user msjf modtfy a previoua qtiestton i£ the a&ntisneefe re- 
trieved werft not auttable. The mech^rt tsm for select it>n of sentences 
mu^t ]k 1{«^t in mind to get b&sC r*tuLts<. 

4) Lifidsav. tfhlLe at the Carnegie Institute o£ Technology n 
KoberC L^ridtdy (28.) prograTimied th^ SAD SAM questlon-anEWETlne system- 
H^t input to the system la a. ^et o£ flenCencea In Eifilc Enfiliehj a 
BubetC of Enjllfh devised by C + K* Ogslfln (35) ^ which has a vocabular;/ 
af db^ut: 1500 words and a elmplc Subset of the full Rngltsh gram- 
mat. The SAB pari: (Syntactic Appraiser and Dlsgrrimmer) of SAU SAM 
pciTses the sentence MSlug -* predictive analysis scheTiie. THe Saman- 
tlt Analysing Hachine (SjSM) eKtracta from th««« parsed aent^nCea 
infomifltlon abmiC thft fiXmlly relatlo-tteVi Ipa of people mertCi&ned; ft 
Etorea this in f o f coa t ion on d computer f tpresentatton of Che family 
tree J and ignores all Other information in the sentence. For example, 
froiB the paraitig vi "Tocn, Mary' a brother t went to the Store." Lind- 
say's progrsiJi would eKtracC Che sibling relationahip of Tcsm and Mary, 
place thew On the family tree as dcstend^nts of the same mother stvi 
father ji and ignore the informacion about where Ttm Wflnt- 

The information stofSge structure UEllla&d by SAD SAM, namely ^ 
the family tree, fflctlltaces deduct Ions ffOm infomatlprt Implicit 
In many sentences - Eeceusij ^ family relationship is defined in 
term* of the relative position (no pun intended) of two- p-e-opLe in 
their family tree, tOniputatlon of the felationshlp la independent 
of the number of sentencei required to plate in the tree the path 
between the individuals. 

EKtetidittft; the abilities of eh« SAB SAM system would requtre 
r«prpgr*mrain3 . f|o proviiion is made for interaction with Che uaer* 
So tntsrnsl fcoowLedge of the program structulf* is necessary if the 
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user restricts his. q;uerteft to questiona ^>S farntly relationship*, atld 
hi* l«itgudg« to Basic English. 

5) Raphael. The SIR quea t laii- *n*wer ln;ft ayaten CiiineinO<Tl t«r 

for Seisantie Inf oriBa t ion Retrieval) was designed by Bertram Baphael 
(3&) aC HIT* The SIH syStean accepta simple SCtitentea in any o£ 
ahout 10 fixed formats useful for eKpresEing tertain relationships 
hetveen obJetCi^. The semftnttc relatlonBhips exCtacted firom tJiEEe 
aetiteneea are Ch»se of set mamljersh Ip , aet incluitoOt subpart, left- 
to* right position and ownership! 

The Information about Che reLatlomships between TSriiJuiS ob- 
jects is atoted in ^ scniafLtic network, where the nodea ot the iflet- 
vatk «ee ebJectB and the reLationshtpS ate Indicated by directed 
labelfld links between n&dea. For eKSmplej if the three sentences 
"John is a boyj" "A boy ts a person," ynd "Two hands are p41fC of 
any pereon" wsre an liiput to SIH, four nodea labeled Jobn , bo^fi 
person and hand would be created. Included In the network Would be 
a He* itid tea tln£ eeC meaiberehip between Jehti end bo-yj finpther with 
£ label indies Cine aet intlusion between boy and person, ^nd a ttflk 
ittdlcaCinE bsfld is a subpiJtt of person, with the number of pfltt3 e^jual 

Separate qveat ion- answering routine* are used for quest teTl* 
involving diffetent relationships- Each TDutitt« Cakea cognlEanee 
of the interaction *f various rwlatlonslilpg , ;iT»d tan deduce answere 
fram thft linked structure of the network j imjependcnt of the number 
of senttrtces which IrtTe necessary to set up these Itriks. Far exEiTii- 
pLe, by tuflclng the llnka frotn 'fJohn*' to '^hond," SIR would anawer 
"YES" te the quESEiotx "la n b^rtd part of John?" 

The SIR aystem can Inteifact with thft user.. For eKAii^le» If 
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teld t!haE "A finger la part of ^ hatw|f' atwi aek&d "How many fingers 
doftf J*hw tlflvel" It woulsi reply 'How many fingers per hand?" Then 
If It ii Co 14 "^very hand bas flv& finger s," it 'would «n»i?er th« 
quest lort with "The anBwer La 10". 

Aa^ AKtcnjlona of the Sift system necej!ifilt.0tfl AildLtLdciAl pro- 
graramlng effort, ChOu^h It ia considerably easier to ddd new Syncse- 
tic foriDE tfajtn tt^V aeAsntic relationships. Within the input limitA 
pf the 20 fixed £o-rtnSt ct«C<9NtentSj the user need mrt kno-v anything 
of th* interiial Btmcturt olE the trtforiBatlo-ti atorege atructura. 



El OClii^r Relatend. Woric, 



In addition to those (fu^Atlo^-itiiUerlng 9y<Cem» described 

ttfOV^, a tiLiinher of proigraiikE h4v« beeTt wf ItC^ii tO ti'di^sldte EAglLsh 
itateatnta into ti log teal nOt^Ciotk tO check the conslstetucy of a set 
of statements, snd the validity of loglCftL Arj^uflMntS. In the senae 
that* given R corpus trana formed tP smiiB IdigiCdl fiOtitlcnj aiCVd a (mother 
etit«inctlt> s lo;glc-'ba£ed Bystein c^n dn£w4i:t tUt qu«3tl«A ^''1« this 
atatem^rtt (cc Its; negation) Imp- lied by the corpus?" > tuih l<fglt- 
baaed aystcnia *r4 ^msttoti-anawerlns systems^ 

Cooper Cl3) and Itarlington (14) both hdve progf'anis i^lilph 
tr#ix&lat« ^ Siibssi 4f English into the propositionEil cei 1 cuius ^ I>aT- 
Itngton is alsc working Cn prc^grams which can tranalate English into 
the first or^Qf Stwi secssid order predicate calculi- A difficult prob- 
lem being tsoneiderBd by Dai^lisigtori, In trying to handle imp 1 teat iona 
of Engliah EtateuientB in ttrmS s?f their logical translaition, is the 
dfCermination of the proper level of ^n^lyfie iw a pacticul^r piiTob- 
len - chat ia> whether to translata ch^ input into second flifd^r 
predicate calculus irtiiCTC pta<i£a are vtry difficult , or to try to 
uae first c?rd(^r predicate <3r propoaitional calculus to prove the 
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tbeprem, aoA perhapft £irii J-t LoaleaLLy inBufficient. 

At the Mattonal lhir€«u oi StatHlflTda, Kirsch (i2) , Cohen (10) 
»tiid £ill«flE OS) have deeigneJ a iy'Stem in which pictures and Enelieh 
lungu^gt StateiEentE are converCC^S Eo estpreea ioiis in the fiTBt order 
preJitflCe eal-cuiuE. OnB can thftn check to S-ee if an En^liBh language 
Btatement i9 cmtieiBt&at viCh <i glve:tl t^Lctuirep 

McCarthy's Advice-Taker <30), though iWt dealened to accept 
Etigllah input, wouW tnake a.n eotcelLent base for 4 quest ion- flpJWsJ*^ ins 
BystesB.^ Fiacher llscl^ (Z-} haa programnied a ayatem which can ds all 
of McCarthy's Advlce-Takcr prohlems, and can he adapted to accept -l 
very limited subset of English « Ihe deductive ays tern in Black'' 9 
prDgrara is equivalent to tne pTfopcEltlonal calculus* 

A nwnher of people hav* done vort bearing direct Ij on the 
problea of sclvinfi al^Ebra wotd profeJems stated in Eneliah, Sylvi^i 
Gar fink la C13) *nfflte a paper in which sh* descTflted the heuriatlca 
she would u*« in prflgiraHiBing a computer to S0lV4 algehra word prob- 
l-SBB, hut nffv** wrote the program^ Most o£ the huuriatica were too 
vagiia to really be USed; e.g. just stating that one should identify 
two vatiahlea' naaicS vhieh are only slightly different^ htjt giving 
no good criteria fer a alight diffitcnee, the tr-efl:tmsnt of "this^' vHs 
taken ttwn Car fink le's paper • SiS were a uuanher of aimptifiod *tate- 
ments of «Ig«bra story problems tht Cfsnecrtbed and transformed from 
problems in a fli¥*t year algehrd text bookt 

Michdel Col^taan (11 J , at MIT, wtrete a teifra paper deacrihlng 
a prograiB o£ hi« which a eta up the cquattort* fo* aoae: typea of alge- 
tr?i Story pTD-hlana <alao handled by STVUEISIT) . SofM of the special 
bey-riatica 1 use fof "age problems" were inspirE^ by techfiitlties he 
invent ed« 
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ItH hifi theals, David Kuck (24) deacribes hli Ideas on hav tc 
constrtJtt thia type of program > but again did ttot ijnplement thea* 1^1 eas. 
He suegesta method a for trpns format ion of ET^gltsh Input to aqq,jations 
Uhlch would require iSLfCh mote inf ormat ion aftont word a than le uied 
in eh« STDDHin: proeranij an4 therefore veif* not applicable: in this work. 
The SrUDEHT program coneldcfB words aa 3ymb<il*j i»nd makea do with 
as Little knowledge about the meaning of vsrd* <is ia ccopatlUc 
with the goal of finding ^ aolution to the pdrticulaT prcblem+ 
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taagtSlt, II; SEMANTIC GgmRliTX™ ASTO ABJALYSIS OF DtSCOUREE 

The putpoi« of this chapter It to put tte t ethn tqwes of an* Ly- 
sis erabedded in the StTJlffiifT program Itltfl a wider canteKt> and indi- 
cate hoH they vonli fit InCo a m^re getierai language prot*asirtS sys- 
tem* We will dee tribe in thio chfiptor a tlieory of semdntt* ^^Tiera- 
tlon and analyaia of <) iicourfle, STUDENT can then be considl'Qtii^ a 
£it«t approKimat ion to a cisinpiJter implenwntation of ths anslytit 
pott ion of the theory* with certain raStrictiOTiH on the Intftl^reta- 
tion Cf fl discourse to be analysed. It will te evident frmii the theo- 
ry vhy analyato is «o £r«atly simplified liy thfe Impoaed reatriettOrtS. 



At Languagg a a C pnini.m i c a t i ChCi ■• 

language it in ftncfrding uaed for CdmfnlJl teat ton betvesn « 
»p«akei^ and a listener {or writer and rcailtc) . To transniit nn 
"iiea"^ the apeaEcer must flret encode it in a BeaEage, ae a etrlnS 
in the traQsmission l^ngu^ge* In order to undefeCaLid Chia meaagsB* 
d lUtenev mist decode tt> and extract its mcanitie. The ctrflns of * 
pdrCicular message, M* ts a function o£ Wth ita global cc^ntes^t *nd 
local cOfiCeKt. Hie global cOfiteKt of a measage ia the background 
knowledge of the apeakst *t\4 the lieCensr, intludins aome fcnowledigi; 
of possible unlveraeB of discynrae, and codings fdf eoine simple iie&S, 

Eie local context of a meaaageK M, is the set qf meeaaga* tem- 
porally adjacent to M. H may refer back to earlier me^sa^es. M nay 
even be jufit a fncdif icat ion of » previous message, itid otily lindar- 
atandable in tbt* contest ^ For *;icaiiLpIe, consider Che stcocid sen- 
tence of the fol Lowing discourse: "How ntany chaplains sre in the 
U.S. Atfayl now many «re to the navy?" 

In ord^r fee cflramunicacion to take p-lace;H the Infermatton map 
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o£ lK>th the llBteDfiT and the speaker must b« S|3praximatii:l}r the aame, 
4C ItSSC fc^t th6 un£vsT«e of d Is course ;i ^Ibo the d^c-^ding ftoCess 
4f Chs J-tsttiUV nuiAt b« in approve Imate inverse of th& &nt(^dlng ptoC^es 
ot fhi ip64ket« Education in lan^uaje ia, In ldTg.e p^rCji 4n attempt 
C^ f-Di^^e the Language praceasars of different people ixit€i t anLtozm. 
H^ld t<» E^ClLltate succesBful communlcatlan. U^ are ttot proposing 
Chat identity in detail Lb Bchleved^ but as (^ulng 44' itlC-C:!^ put it 



"Different per Bona growing up til th« »«ne language are 
like different bushee tyirtmftd ind trained Co take the ahape 
of identical eiepijancs. The- /mjitouilcal deCaila a£ twiga and 
bratichea will fulfill the c-lephflrt tue fonn differently frortn 
buah te bu&h> but th« Overall outward result b are alike«" 



Aa a ipeaVer ttfnsniita Euccessive measages conce^rnltig So^nC 
povtiotl of hll inform^ticin map, the lietener who 4n4 e r S C an/d e the mes- 
sa^v-A ccmStructs a model of a "BiELiaCi3n".i Tti^ t:«l4tion between the 
listener ^e mm^el tmd the speaker^ a information nap it thDC ftom eeeh 
e*n be extracted the transoiltCed InfoiKnatttsn te Levant to the univerae 
of diBcouTae, including lnfOFn»aClcti deducibLe froon the entiT«± a«t 
o£ ineeEaEeB. The internal struCCure of the Listener's model tife&d 
bear no reaemblance to that cf th« Jpeiker, end may in general tan- 
tain far leaa detail. 



B. 'nieoriea of tanguase^ 

According t& Morris' tKeoty of signs <33>> tlie encoding and 
decoding af langiLage can be etratified into three Lev«ls. The first 

level is the eyfttactic whlth deals with the relationshtp* of figna 
to other aigne. A syntactic analysis, treating words as iri<iirLb<jrs of 
classes of words j tSfi yield structurings of messages which indicate 
common proceesiag features* The second level, semsntie anelyiis, is 
tonc^smetf with the relationships oi *i^ne to the things they deflate* 
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A ttitrrf I'evels prafiiii^Llc analysis, is totit^i^rt^d with the relationships 
■between Eigne an<J tll'ftii: iLiCerpreCatitms in tctmS flf actlcmB re^giiired* 
0«T tbeory vill deal with 9 11 three Levela of «ii4ly«is, with a pri- 
mary emphasis on the rtlatiom of the semantic dJpecC of language to 
the Beneration of iJiscou-tae. 

Hany theories of &/iit4K ttflve been dev^alope^ to describe the 
•traeture ot English, imii m^iiy «if these have aerved H bacts fo* 
ccTiiputeP prugrama which perform syntactic analysis* Fs5t a coaplete 
mrvty ot auch ayatema see the paper by Bobrtre (4). Almost all 
of the»e theortea tgojore the concept e of meaning and sem^intica. ■&£- 
C^u^e they ignore finch aa impo r t ant aspect of ltinguage» pifOBraoiB baaed 
Ota such theories often yield aatiy posaible atructurln^J foe a single 
ientence which La imambiguouS Co a per a on. Kith somiB uba of Beatiingi 
Tuariy of the meaningly b 6 amblgviotie interpretations could fee eliaitia t ad . 
For * goad dlacuBsion of vhy flmbiguitiea arise in ayncattSt atialyais 
SEE Kuno acxd OeCtinger {'lb'). 

Bu«d Oti acme ideas deacrlbsd by Yflgve (46) , a nuinber of 
programi hsve been written which generate ayntacticEilly correct En- 
glish strntedcea. In moat crises* thu sentenceE generyt&d sre pre- 
damina t el y mt^ Ci i rtg 1 ea a nonsense* The coherent diacoiirse generator of 
Elein {23) is ttie *tie exception I ktiOW. Klein utilizes «n tnpwt test 
from which he citractfl certain structural dependencies oi th* wcsrd? 
in the inputs He then generates sentenCifia sa^ beture they arc r*- 
Leas-ed for output , n poatproceasor check* to see if the words in the 
generated aeutenjce aattafy structural depcndeticiea conaiatent with 
those fciund in the input tejtt- However, svwa in Klein' a program no 
attempt is niade to u«a the defictlve meaning of any nordj except in 
so far as thi* laeanififi la teflflCEe4 In it a cooccurrence* with other 
words in Che itiput text* 
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Soma thecjifics wtilch do can^idet the problem of semantica ate 
balng developed now, PendsEraft (17) «t*tes that th« proerama being 
J eve loped at the LitiJuiBtic ReaearcH Center of the Unlveraity of Tfti- 
«« are an ejtpllcatton of MoTTisf CTia^ry of aign.t. Though not yftt 
implemented, the scmimtLC analysis pre gram wlLl make use of a pre- 
liminary phrasfr structure ayntactie dnalyaia. A cmmber of synttCCic 
structures, with Sppropriate voca^ljary iteaaj will map onto atrial* 
eemantlc constant*, eEaentiaLLy tndicatinfi that thcie atructure^ all 
have the same ?tieaning,. Thia gives a type of caticinical form for 
stnictLirea In terms of their Knesnlnga^ but dots not utilise stty bk- 
pliclt nodel of the world. So provision le maie in the theory for 
deduct lin^n of Information Iwfjlielt in a aet of Sentences, 

Lamb (26) also hag (p^rDposed a atrsti f Icationa L theory of gram- 
ma r,. not yftt Implemented on 4 computeif , in which auccessivc Levels of 
atislysii are performed, ^ith a final mapping of the input Into stryc- 
tiip^^: in 3 "pememic" stratuni of the langujige. In thij scmeniic atta^ 
ttiffl ar^ bundles of "sfciiK^cs" or Ee a nl rcgs ^ ijnd Indleattfin* of the fi- 
lat Ions hip* between diff^tent bund lea. Different sentences ^ich 
aeacv the sinne thing should map Into th<3 same sCructur* tn this 3i4ni4Bl& 
BtratLitti» Sememtc stnie tyres arc thus canonical represtntations of 
meaning . 



C. Eef InltJLod, of Coherent [? i s c-outa e , 

Tbc theory of langyaga fienaratiort and analysis which ve shall 
describ* bclvw ia dealgned to handle what we call cfth^irent diBcou,rBe. 



A discourse i« a ae<]Lietice of a.enteiice3 such that the mfeSning of the 
discourse canncit be detemfilfted by interpreting each sentence inde- 
pendently ^ dlsrcgardina the other aentences in the dlncourae. The 
interpretation of each sentence nwy be dai^endfrttt on the local con- 
*:*:Kt> in the sense defined previou^t]/^ A diacourfe ia coherent if 
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it h*ri A fcMipL^te and consistent i.nt exp re C a t ion i. CaTTifil<;t<:fi'ess ijn- 
piiea that there i* ilS aubatviftg uichln the discoursE that doei not 
have soTue interpretation in Cke m*4&l u£ the altuatijQn taicig built 
by the listener. 

A listener^* dblllEy t<f Wild a u^del of s situation from a 
discpyrae is dependent on inft^ntiatlsn available to him from his gen- 
eral store of knowledgBn Thcrtf&re it is quite pea a it le for a diE- 
COU.T»« to aeem caherent to one Listener *tld W>t another. A writer, 
r««d£a& hi^ OWi vrltlng, easy feel thut he hss ge:ii&rated a caheveut 
S«qu<;nC« of sentences, but in f^ct , it 14 ittC-Oherefit to all other 
r«ad«T$i Ihl£ Le, unfortunately, not 41 t*ri OCCUrr«n?« in the gci- 
etLt££ic liter a Cure. Conversely k * listener vho is a psychiatrist 3 
for eKj^mplc, uay flcul coherence in, a Ee-quen&e o£ rem^ir^S fhld^ a 
patient *hln'ks are entirely unrelated. 

The ETDIKQSrr BySt&n utHiasea an eitpandijble store o£ geneftl 
knovledge to build n Tn^cleL o£ a situatian described in a member of 
a limited class of discourses ■ The form of this model of a a it lib t ion 
buillt ty STUHEHT will be 4liC^S3Sd In detail in a later section of 
this chapter. As far as T kn?3*f, STUCKttT ia the only computer im- 
plementation o£ a theory of discourse Analysis tWW esttant that raape 
s discourse into some repxesantation of tt« rt&anlt|g» Vhea the theo- 
rles of L*tflb- and FendeExafC sre Implemented s Ebey ehpuld also be 
able to analyS'S this clsas of discourse (and oChefS)+ Rarris filso 
talks about "discourse analysis," (20) b*it in hia uS« of thti teres 
he apeoificJlly excludes the use of mEEinins» st*ttti;ftJ 

"7b.K method [of diacDuxse aiifLlySis] la ffrnisl, depending 
only cjn ttie occurrence of moipbemcs a» di^tiiiEiiieHsble ele-. 
irients, and not -upon tbe atislyat's koowled^e of Che particular 
meaning of e^eh muifp'heme."' 
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D* The Uce of Kernel EenCeciceB in Qur CTi^oryT 

A b«fiLc postulate of out theory ef Idn^^g* 4aiLjs£B Is that 
4 littensT undeTftandB b dlscaurae by tranafomiilg It Inta an ei^ul- 
vftlont (in meaning) eequence of sliifipl«]: kernel SfintsnceB. A kernel 
acntence is one which the liatener can understand diTectly; that 
ift, one for which he knovs a transff^mfnat l»n Into his infcnaation 
atore. Conversely, a speaker gene? at ts a i^C di. kernel a ententes 
frnm his iafomiation map p and uClli£t$ 9 SftquenCe o£ trauBfcnEstions- 
on this Bet to yield hia apti^ken dtflCouree* Thta set of kernel sen- 
CenceB Ifl not Invariant frtm ptvtcn to person, and even varies fcu" a 
B Ingle Individual »ft he lc«tn«i 

T>ie ijte af l;crnel S«n£en£ea In this way is contvcveiftl^l » 
tliove^e:^^ the tbeoiry ii proposed as a eocnd framework for uniler« tending 
'Tld inplenenCitIg language processing on a t«DpLLterj net nvCcftsdrily 
d4 4 model for Imm^n behaviour. Itie uaefLilneaa o£ tll^ft theory sb a 
paychc logical model le Hn eTiipiri04t que scion. Skinner (Uiy hss 
jtven Boqe psycholoi^tcal Juat if icittion for aaaumlne the eKlstflnee of 
a set of hade s tnt cnC cb , and Chomsky {7} haa diacuaa^d the linguis- 
tic veTiCe of the U9« of the concept of kernel aetttetjcea^ Despite 
this connon cornctpt of kernel deDteaceSp in practice J Our u«e o£ 
kernel aentencei ii different then that of iSkinner or ChomBky* Our 
use of kernel tttitenCeB ts a baa la of a Iang.uage is ?nS logons to the 
U»e oi. generdCors in defining a group. 

Although Wia dfc not propoaing oi^r theory ts A baa la for a p$;y- 

cTfralogtcsl model, it has been iipefulj to avoid Circuralocutiona ^ to 
desorlbc the theory in teriBB of the properties 9ild DiCtifHiB of a hvpo-^ 
thetlo^l fipeflher oad llsten-er. All atatements about Speakera and 
llstiiOcrB should be interpreted as referring to Computer prograqia 
which respect Wcly, generate and analyze coherent discourse. 
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E. CefieraCicm of Cohartrnt DLin;i:mrSc> 

1) The Sptfdkcr'i^ Motl^;! ot th^ Vof Id. We aasiioe that a 
apeaker h^a aoine model o£ th'S ^nfotld Itt his infonn^tton it<n;v* W& 
ahaLL not be conceroect here vith hcv til is «\oAsl was buiU] «r its esC'^ 
act fonniit Different fdlfma foif Che eachdel uiLL he useful for different 
Language taak8> tut th«y muit *ll have the properties described be Low. 

The haaic component-s of the tiKH^ul ^tK a Set o£ obj&ttS, t^jj > 
a aet of funct lona {?? )» ^ ««^ '^f relfiticn^ 4 ^^ j > ^ ^^t^ of pv^'^' 
poaitiona \^ I ? ^^nd ^ set. of 4fl»aill: it dgdui^tt-vC irulei» A funCtton 
f" ia a mapping frcmi ordet^d j«Cf of n objects « Called the »rgu- 
metita of F^ , into the «et d£ objactf + The mapfiin^ nay bit multi- 
valued and ia defined only if the argLtmentS SUtisfy £ set of con- 
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diticna aaaocLated with F ■ jA ^Dnrdition it efi^antislly m^mbctsbilp 
in a cLaaa of ohjectdp hut is defined more prseiiQly belDW+ A ta- 
Laiti-Pn ft. la a apecial typ-e of object in the taodel^ and consi^t^ 
o£ a LaJbeL (a vicilque identifier) » and an. ordered set of _i| coEid it iona , 
called ttie arfsument -conditionB far the relation. FiinctioDS of re- 
lations are a^a Iri relationB. 

An elgmiefit^i'y ■pl'i&-pOiHi»B e&nalsta of b label aaaoclated vith 
siome relation, H! , ^Txi ati ordered aet of n o-bjecta aatlafying the 
argument conditione fot this relation- One aaay think of these pro- 
positions as the heliefs of S B-pcaker abfluC what relattonshipB h&- 
tween ohjeote he has notlte4 are true In the world* Comjalex pro- 
pp-Eitionj -are logics 1 tOfnbinat iona (In the tiaiial aenae) of elementary 
propoBitionE » 

th& aegflantic deduct lire rulee give procedures for adding new 
pTop-aBitaaa to the model baficd on th^ ptOpoe Lt ICiriS ii^V i-A Ch& AiOdftK 
In addition to the ordinfliry rules of logic* tbe^e rules Iftclydt a^iom^ 
about the relationships o£ the rel^gcions in the tnodel, Xlie aeoiantic 
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d&dutttvo rwUs «L30 Include links t& the senses o£ the epeaker^ For 
example, one sycb Reductive nile for adding a propostton to tht mtH^tl 
atght be (looaely speaking) "laok In ths real uorld and see If It !■ 

true." Thaae rules eisentiaLIy d^tenolne haw the m&dtl is to be eK- 
paudcd, and era the most coiiiplev part of a CQQple46 System. Hqw~ 
ev<t, £totii o^r pi^es^nt point of TfteWj we need onl^ tons id er these 
rules as a black frox vhich can extemj the set of propoeltlona ia the 
model. 

A e loaded question Is a relational label fot some R^ 4ft4 an 
ordered f ftt of n objects. The anawEr to this question ia *ffirmativa 
if tlie pr[>pDait±onj cce^sistin^ of thie Ipbel and the n objects j is 
iJi th« Bwdftl (or can be ttdd^d to it}. If the negation, of tbU pro- 
position ii in the mode] (or Can be added) » the ansver ia nesative^ 
■ptheifwiS'C the answer ig ucnde Fined. 

An o£en quest ion tionsista of a reUtlsnal label for m ti- argu- 
ment relation, R^ * and 4 set of objects cis?T«spondine to n-k of these 
srgujneiits, -where ntkil , An anawer to «n Open question 1$ in or- 
dered set of k objeetSj iuch that if these pbjects are asaociat44 
vitb the li Tinapacified urgumCnta of fi^ » the resulting proposition ia 
in th« HH>dei or oan be ictclccl to it. An, open question may have no 
fflEL?weif*j ox may have one or more BJiswexs* A. condition ia in open 
qyest ion with k=lj atid in object satisfies d condition if It la. an 
«aswcr to ttie 4]uestton» 

2) Generation of Ksmel Rei^tentea^ Ue huve d^sscribed the 
logUal properties ol the epeakerfa Htodel of the world, ffe shall 
tWMj consider hpof strings in a LflnBi]ag.t;, vorda^ phr*eeaj and sentefices* 
ar& aaaaciated with the model. Corresponding to the Set of object* 

0^ thare ia a aet 9^, of strings (in Engliah in Our tase), 
called th<S ubibeb of the objects* There is « raany^one Bftpping from 
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{H. t\ onto /o \ I- It Is many-one trecauee on« tfbjfrtt tnay liavt more 
than oiie, name, fli^i frankfuirCer and hoc dog botti map back Intu the 
sane object. In tht ptodel* 

Recall that funetlMll »flp n- Cup lea of object A lnt» ebjecta* 

Thus a funcCion naiae afid an n-Cuple can apetify an abject- We 
can dexive a narae f-or Cl^ia objecc from the £iiiuetlon n^aTii? and the 
names of iCa n^rf^uracnt^' .AssoclaCed with e<ach £uitCtion l4 at 
l^aaC one Unguis Cic fortDj a sCrlcng af wards with bLitnhs In li^Lcll 
narnea of argLnnenCB ci£ ehft futtCtioci muaC be inserted r EKampL^a. cif 

llnguiacic forma nasotlateiJ with a model are "number d£ "p 

"faCheT of ", and "the child of and "* Hitlf* 1* 

a raany-one mapping froci the s&t of Itnintatlc forma v^jj: iorto th« 
iet of funcciona. Two exiimplfti of mol-ttple Linguistic form:S for 

Che aame funcCion are: "fcithtr of " and " ""s fothar"; 

and " plua " and "the iran of and ". Thus, 

If objects K and y have names "the first number" »nd "the second 
tiufflber" and aaeoclated with th* function " * '" Is the linguistic 

iorm "the produce of .ind "* tticn the name of the object 

produt&d by applying the function " * " to iS and y is "the produce 
of the flrtt flurther atid the aecond number" h A parsing of a name 
thus imifit decompose it into the part which i» the Linguistic form, 
and the parts *^ieh sre nanea of flrguments of the correspond ina func- 
tion. We aha LI calL objects defined in terms of * function and an 
ft- tuple of objects S functionally defined object * and th034 which 
afe iKJt functionally defined «e shaLL calL Elmplfe object-t . Simple 
objecta have simple names aeid functionally defined cibject3 have 
CQimpoa it e nrnmes -r 

In addition to linguistic COEWS associated with functions, 
there are Linguistic formj& .-tsiSotiated with relations, Fot an n at- 
gtiment relation there are n hianks in the linjuisCic fotiBi Eifaaplee 
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of ruilAti^n^l linguiatic- forms a re J " equals 



«*Tf fl , to " and " speaks". It ia thla 



»et of lin^latic forma ^ corr«3]30fL<llTtg to the reLatlqn^ ifl. the model, 
that 9^Tvv 46 franiea for the k«icn«l tftntencea. 

In a manner Bimilar to the way contpoalte naittes Sta built, b 
V.6tnel ientencE correEpoiwitng to an elementary proposition is con- 
structed by insercing cxasncS torrosponding to each argument in the 
appropriate blank. NaBltS may be simple or coBip*site+ An example q.f 
3 kernel sentence for a propoaltion built ftflrt futh ;i relational 
lingiilatie fovm is "John's father gave .3 Cinw;* the aalary of Bill 
to Jack*" Whtcti Contains the almple nimt;.* "John", ".3", "Eill"> 
arid "Jack"h It containa the: functional ItnguisCic foiisis " 's 

father"* " tlmee " and "saUty of " and the r*la- 

tional linguistic form " ftave to ". 

A kernel sentence car r e i^ond Ing to a CDiopl.@x pr*p«elttoii 

i» Constructed recursively fPCm the kernel sentenc^E cors^feepond ing 
to lEl elementary proposicicngl constituents by placing thefii in ttie 
ccrtespondiug places in the ling-ulstic forms ^'' atid ", 



or ", "not ^' etc- 



^e kfiETiel tientcncd CoTrsEpondiTig to a clos*^ question ±a 
constructed from Che fcetnel of the correspcmdtrtg ptoposition by 

placing it in tht Llngui*tic form "la it trtie that 7" For 

dn open qi^ieflCi&rtj ■dcmmy objects are placed tn the open argument po- 
sit iona to cctftpletfc d propositlonal fomtio. These dummy arguments 
have names "'L?h&", "what", "where'', etc. ^ atid which dummy obJecEt are 
used depends ^n thft Conditioa on that argument i^oSition. A question 
mark ia pi seed at the end of the kernel seut^nCe constructed in 
the ^Etial way frmn the relational lin^uiSGiO form and the nam«j of 
the argument s . 
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In genera tins a c:(>b*r^nt dtscouTBe, a speaker chcmses ri nunn- 
ber of pTopo-BitlonH in hi* mijil*]! atvd/or scune open or closed ques- 
ttone. He chen uS^B linguistic inforraation associated with the Tnodcl 
to conBtTUC-t the set of k«rjl*l sentences corresponding to this *6t of 
chaaen propoa it Ions * In the tiflKt s&ctium ■tre will diHcuaa how he 
genera tea hla discourse from this S^t Qt k^rp^l^i. 

3) Transformation;;, on Kernel Scntenn^CS . The a^t of kernel 
aetiCeLicea ia the base o£ tha Coherent dis«Ouva«ih H^e m^antng of a 
k&rtiel sentence is the propositiort into which it nia^a> and aiiai- 
larlyj the meaning of any name is the object HllitK is its liMge un- 
der the mapping. To thia set of kernels we ^pply ^ »e[:{uenc« ^f 
Uteatiing preaerving trana format ions to get the fln^L ctiSCOufS&t He 
vee the word "trana forma tion^^ in its broad general Sense, not In 
Chfe narro'lf technical aenae defined by Chomaky (7). 

^ert aT« tVi} distinct typea of tvana farms t ions, at rue t Lira L emd 
definitional* A stiCuCC(jral &r eyntactic trana format ton ia only de- 
pendent Oil the «tfuCCUi:« of the kernel 3tifiLXg.<a) on which it operates. 
For example, one a>TiCacCic t tans format ton takes a kernel In the ac- 
tive voice to one in Che prtissive voice. AiWtb&r cpmbtnes two sen- 
tences into a single conrplex coordinate sentence* 

Otufi latge clese of syntactic trana format ions is uaed to sub- 
stitute: p rc.]Ti£.]miifLe I fihraaea for names. Pronominal phrased m»y be 
ordinety ptrs^flcmtis auch a a "lie"^ "she", or "it"+ They may be refer- 
ential ptive^e^ Sueh ae "the latter", "the former" or ''this qu?Jntity"* 
They may also he tfUtlcationa of a full name auc-h #s "the distimce" 
tax "the didEoAce betueen Hew York and Los Angeles" n In qHssb wher* 
siieh pronominal reference is m^de, th« Oohe-rcnc* of the final dis- 
courae la dependent on the order in which the resultant ^Cfirtga 
appear . 
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The Second type of trafiefeoitftton la <j ef Init iona 1 , It in- 
volves eubfltitutioiLS of lirtguistlc string* ^tw) fomiH for ene* ap- 
pearing in the kernel sent-cntps. For exan^le, for any ^|>^e3.r?nce of 
"2 titmee*' «e nay Substitute "ti*lcc", and for ",5 timeB" eubetttute 
"one li^lf ef"* In addition to thia atring substitution, Sdrafr trans- 
forma Cicus perform form aubstitution and rearf;ingeiiient . For example ^ 
for a JiTjrnel sentence of the fonn '^ k is _^ fliOre than z" t vhere k^ y> 
and a are amy names, out deHnltional tranSforniEition car SubsCitiite 
"k ftKteeda _e by j^." 

Stme trepafortnat'lons are opCi&nal, and soiae may te tnandatcurjf 
if Cftrtd Ln . f ormii are present in the kftfnel set. Certain trangfprma- 
ttons are used by 9 speaker for atyliutl^ purposes^ for example » 
to emphasise ceetain objects: other iyncaGtic trsfts format Ioue E^ch 
Si those which perfoma pronominal, stibstitutiona art uaed because 
they decreaae the depth of a construct loo, in the Sense deflp«ij by 
Yngwe (44) . 

Let us T«vlev the ateps In the generation of a coherent 
discourse. The speaker cht>oaes a set of propoBielthrti; , the **tdeas" 
hft wishes to transmit. He then encodes them aa Mngu?iEe strings called 
Itofnel sentences In the manner deetrlbe-d above. He then chu^aes a 
Sequence of strttttatsl end definitional tratigfyt^jtlDCLE ^ich are 
defined on this Set of kerne La or on the ordered set of sentenoea 
which result from applications of the flrat tlf^nsfD-nmationSt The 
reaiilting eequ^jnce of eentenc&g wiLl be a cotKjrent discourse to a 
listener if he knows all the definitional transformations applied* 
In addition, for every pair of ■Ustinct naocs which tbt Epea:ker mapa 
fcflck Into the sane object j the Listener myst 4lao map into a a Ingle 
object. 

In order to tlarify this theoty, we ahoVj in Appendix E, a 

sample semantic ftCMr stive grasranar irtilch will fi.«n grate coherent dls- 
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CDurie underataoddbla by the STTJDEHT finsiyeia profiTam. The ub- 
J€Ct8 are DumbcrB oiui the fucictiana *re th* air Ithnetic opefStt^iW 
of Sums dlfferencE, product and quotient ► The only reLdtion in 
the iHOdel la numeTleal ej^uaLlty. Ihe t raii3 f o naa t ions ine ttc*CFlbed 
infernally; fuTthc-r HrtBuiBtlc invest t:eaC ion la neceBsary bcfote a 
fctaal nptatlor ttrt ErartefoHmattoas can 1« dteided upon, Pjir^sllcl 
to the graoaar ia y a^Jsnplt ||>i?obLeni genera tad by utilising tlii* grflB" 
™ar» thia problem ia solvable by the STUDEHT system. 



F:r Analysis of Coherent DiacatiTBC- 

Girtevation of -cohtii-eT^C dia course consists of two diat ingutsh- 
abLe stepa* Prcm prapoSlClotva in the spe^k^r^i Bodel of ttie worlds 
he geneTlie«S an ordared set i>f ternel sentences ^ Hii then applied ^ 
sequence of tt etie f onma t ions to this kernel aet^ Tht ^esuitlnB dis- 
coiiTBE 1b a to4ed meaaage whicti ia to be analyEed and decoded by a 
lietener, O^he listener's ptobleto can be looaeLy characterized 5ie rjn 
eteeffipt to anaver the questions "What would, I hav« Di&Btit If I aai4 
thftt?" 

1^ analyze a dAscourae the listener raiat flrtd the set ot ker- 
nel sanCtncea fram which it was gon^irated ; one w*y to Ao this la 
to find a a^t of inverse trans fomuatioTtS which ^cn applied to the 
iTTput dlit^UEse yield 9 sequence of kernel sentence* ■ the llatener 
ffluat then transform those kernel 9Q.tttences to an appropriate: lep- 
Fesent«ti;i^n in his infcrmation Store. The appToprtatiinaaa of ft rsp- 
tesentaci^m La a fyiittloni of what Later use th^ llatener expects to 
make of the information conti»ln«d in the diatO'TjVse. Tlhe Listener 
may Biunjltaneously ttansform 9 ^ivif- kernel sentence into a nuraiber 
of different representations In hia infortnatlon store. On * level 
of pragmatic analyaia, ecatOPi&ntB require ortly storage of Infurmation. 
Quest ionE Snd lEtperativea require appropriate reEponse:* ftoo the 
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llatener. The difficult ies in analysis dicltotemiic Into those 
asaociaCEd with fltidirtg the kernel sentences -whieb ate the base of 
the diacDtiTBe, atwi th^ne siiotisted vitli transf»r«iing ttie kernel sen- 
tences Into represfctlCstloni In the inforraatlon 9C»r«: + 

M^tli^M? {29) haa «ug;g«-5t-Ed chat analyst a can te per formed by 
ByntheaiSf A e^Si^uenCc ai kernel sentences , dnd a 4f€quence of trans- 
fon«aCiOrt9 are chosen , «nid tbe trans foiritiat£cnJ tre applied to the ker- 
nel senC(^tk&ei+ The resulting discourse 1$ matched j^geinst the in^ut« 
If th*jr aro the BSine, these kernel sentences and transfarsiJtlons giv* 
the tr^qulred Analysis of the Input. If nirti a c^ih^iige is aadt a^ th^t 
ths Y^Sulttil^ diEcouTse becomeB mo-r^ like th4 inp-ut. 

If the kernel sentences and tranafsmiations were chosen ran- 
dcnlyd thld method would Dbviomaty b« too inefficient to work In 
any prscEical sectBe, However, by Mtililtnf^ clues within the Input 
dljCQUir^fr th« choicE of kernels and transformations can be grest ly 
resitrlctetC* Thi* technique of aentencft snslySls is being implemfcntftd 
In a prosvant ttln^ wriCCen at HITM: by Mslke-i- 4nd Eartlett (43). tht* 
technique hag the advantage that eHBcCly the S^lnrt |;,Ta[imar can be 
Utili^^ f»r both jani^ lysis and gene ration o£ diScouTse. 

A nvore direct analytical appruflth would utilize a set of in- 
V«Trse anaLjrl^tc trans format ions* If 1. U a trSns format ion that in*y 
be u9«eI %t[ gflnerating a discourse! sfnJ T.CS) ■» S, where S and S are 

seta of Sentences, then Che analytic transfonmation T. la tha ±n- 

-1 — ^ 

verse of T^ if and only if T^ (S) = S , The choice of which in- 
verse trsnafflftnJtions to apply and the OXiimv of their application 
R^y agslA hi restricted !by utilising htutletiOs conceTned vith 

£«atufeS of the input. 

Once the baae set of kernel sectt^ncLBig for a given dis- 



courae Is determined, there rcm^tii* the probtCTi of en Ee fine ref*- 
|-«$«nt:atianfi of thass' aentenceS in £hft liet6ti*V* 9 infoarnati»n store , 
Th« -MibJo? pirpbleHi in accomplishing thti ttep Involve t tht A^pat^tiiiA 
o£ tl^a^e uords which are part of lingLLlstic fotins £f?t t^\.^ti.onii ^xvi 
ttiose ^Itich are pert ot a name. This is difficult beC^ufle the j^ioe 
word ilnticQ^Taphlc symbol) may have multiple uebb in « Idn^^udgfj, 
Having EBp9rate4 the relatlcmal farm from the names uhicli repTBsent 
the 4tgununCe Off this relation^ Due can then analyze the noma in 
teHnS of CMttpMiiePts ^ich are functional llngiiiatic fomts jind otheri 
wlii'^h iti slfnj^le n^mes- From this pareing in terms of TeljItioniSl 
linguist it' fc^rni9'> f M^tc t li^i^a 1 linguistic forma and simple ndmea, the 
di^^ourse C^n be trdtidfcno&d into a canQnical representation in the 
infoirmatiDn store of the listener* 



Gw LJJtiited Dedu-ctive Wr>dela^ 

A eompliCe underatand tn^ of « dliCOurse by a listener would 
imply that the repreaentatton of tha dliC(!!urse itt Ma information 
stota £,S easetitially is amorphic to the ipe^kft^S ui&del of the world, 
at la»*t iot Che. universe of dia course* The listener's repreaen Ca- 
tion must ptQ serve all inforraatlon Implicit in the <lLscouTSei 

Xf the liatenetf is Otlly Ititerested in certain aspecta of the 
discotiTEe, he need, only plfftStS^e in forma C ion relevant to hia interest 
tiiA ^isoard the rest. WithLrt hi* a^ea af iutereat the lietener'a mod- 
el is Isomorphic to the specter's- model in the aense that all rele- 
TTint ^tdLictlons which mti be made hj^ Che speaker on the basis of the 
dls Course can also be made by the lleCener. Outside this area of 
interest t the liatener will be unable to answer any questions* tfe 
call such TeBtrieted. in f OiJ^na C ion stores limited dc<Ju&tlv^ unodela* 

Hit question- answering programs o£ LtfldaBy aud Rap-hael, and 
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the STUDETIT sysCeoi, ?iLL utiLl** Umited deducCive tttoiels. For tK* 
area o-f Irtt&foBC in each of thise prograniiB thiife wa^ rj "naturaL" 
TepreEeiiCaticHl for the inf orwiatJ-On In the allciva1>l« input. I'heae 
repreaentation* -weTE natural In that they factLitatcd tha deducti&n 
of implicit information. For escain^lc, Lindsay's tamily traa rep- 
resentation made it easy to compute the relationship of any two irt- 
diiridnals tn the tree, ind^petwJ&nt of the ntrnter 15 f sentence e fi^e.- 
BBsavy to build the tree. 

B6C^u£.e the Dumber of if <] l^a t iona and f unct l.&n« e^cpressible 

in tht mixliil* in all three aj^Jlt^enrt la very limit ed, there is a 
corrta ponding limitation on tbc number of lingniitit forma that may 
apptar in the input. TVtla igreatly eimpllflei Che parsing problem 
diitusse'd ear Her J by restricting alternatives for words in the 
input text. 



The STUDEHT eyate.-fi is fln implementattofi c£ the analytic por- 
ttcn o£ our theory. STlfDOTT performs certato Inverae trans foTtna t ionS 
to obtain a set of kertiel s&ntentca and then tTfansfooiE these Ufefrtftl 
sentemcet to eKpresEiocia in a limited deduct iv* model. Utlllglng 
the power of this dedu.etivc! model, within its limited domain of under- 
atandtng, it is able to answer queation? bas^d pn information im- 
plicit irt the input Infanaatto-n^ 

the fl nil lytic and trans formJltional tecbnlquea utilised in 
STWESt are described in detail in Qiapter 1?* He shall describ-a 
here the canonital representation of objects, rtlatiotta and fun*- 
tlotia wlehln the model. STUDENT is reatricec^ to anawarlrtg questions 
framed la tht Context of algebra stoty probleEs . Algebraic equa- 
tiona are a ttatutal representation ffiir in forma C Ion Itt the Input. 



37 



HiB objects in the niodftl are numbeTB i CX mmbsTB with 8ti ae- 
acpciated diinrnaian. The only relation in tho (W^el ia equ*lit]f> and 
the only f unCt lona repreaented directly in ch* w&del are the ^rlttfc- 
mette operatloriS of addition, n«;gat:loirL, nrnlt Ipl Itationj division 
and eKponentiJitiOfi, Other functions are defined In terms of thee a 
bd^ic functions, by c<3B4poEticn, iJnd/ftr aubatltutlon o£ conatentB 
for flTEumenCfl c£ the-*e functions* For example, the opufatlon of 
S^uafitig iE dEfine4 aa exponent istioil vlth "2" as th» gecond argu- 
Bent of the exp&n&ntial functlc^n; syhtraction is a eompoaition of 
addition and neg$tl»n. 

Within the cemptiter, a parisnEhealzed prefli notation Is used 
for a standard pepreaentatlon of the equations Impltctt In the En- 
glish input. Th* ertthmetic op^JtaClon to be expredfled £s made the 
first element o£ a list, and the steunie^rtta of the funCtiotl are auc- 
ceeding li-et «l«penta. The Exatt notation is given in Fle.*ii:e 2 below* 



ajualtty 


Infix Mcta 


,tton 


FretlK Hfttacion 


A « B 




(EQUAL A B> 


Additidtl 


A + B 

A + E + C 




(PLUS A S) 
(FLnS A B C> 


!Heg3t ion 


- A 




CHI9US A) 


Subtraction 


A - 3 




fFLUS A (MTHUS B)) 


Mult Ip Heat Ion 


A * B 

A * B * C 




<IIMES A E> 
{TIMES A B C) 


Division 


A / B 




(qUOTIEHT A B> 


Exponent iat ion 


A^ 




(EKFI A B) 


FiEUTQ 2: Notation tfichtn the STtiDEHT 


Deduct Ivt Hod el 



In the figwirtj. At E> ^md C are any reprcscntaCloEiB of objects in the 

niodcU either composite or simple nffliw* , The uaual in £ Lie notation for 
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these fLuactianel express iona is given far c cicnp a r le an « E«c^m9v th^d 
1b e fully parenthefiiz,ed notationp tui ambiguity if( CpC^r^tional ordlEr 
arises > as It does, far example , for the LinparenCh^giitd iflflK not^' 
tlon e-xpreasian A^BKl or its corTeaponding natural Ismgua^S ^Kpl^^j- 
a Ion "A ticnea B plus C". tlote alsQ that in Chia prefix notation t^lm 
and tlmga are not strictly binary operators. Indeed^ in the tn^dcl 
they t^ay have any finite number of STguuentaj t.g^ {^^iiE& A E C D) 
1b a legitimate eiLpressiQn in th^ STlXDli^T ncHielT 

Represent at ions of objects in the BTUOtlfT deductible nodcl 
are tdken from the input . Any string of words not contaiAinjt a 
Ilnguiatic form associated with the arlthcnetlc functic^ns expr^^jible 
In Che model are eong idered siutpl^ nau&s fur objects- p Ttiua , "fcha age 
of the child of John and J'afle" iu co-tteldtrftd s gimpLc u^mB bflcauEe it 
contains no functional linggtlstli; f^TmS associated wtth funCttons rep- 
resented in STU&EFT'fl limited d^duttlve modtK In a more i^Bnerlil 
■odel it v&ii^ld be con»idered a i^vxpositE nSme* «nd the funtstional 

forms "sge of " and "child oi titd " wauld be 

aspp«d inta thtlr COtrc«pondiii£ functions in tbe model. 

Because Such CQinpicK Attings $re c^onsidered simple namea in 
Ch* moduli and object e ^te dlEtinguished only by their name-B , it 
If luipottant to determine wh'en two distin&t niimes actually refer to 
tht ism« object 1 In f^cC, answers to questions in the STITDEHT ays- 
tern are statements Oi the identity of the object referenced by two 
names « Uouever » one o£ th« nsract (ttie desired one) must Eatiafy 
certain leKicaL conditions. Host often t:hl£ condition ia JuaC that 
the naiite he a numeral, ^ex a ntott general tiodeL this restriction 
could be stated as requiring a simple name forrespondins to aome 
functions LL^ defined n^me — beciluse^ for example, "number of *"' 

would be a. functional litLgviittic foiTm J,n the general mcdel, and Che 
only simple name for auch an object would be the numeral corras- 
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pE^ndlng to tttia nmnber^ An answer consists of a Btstsment of 
Identity «-S- ^'THi^ number of customeTS Tdq get a is 162." 

Tfie other Lej:Lcal restriction on ansve-rs a omet Ijnes used In 
the STV)]']JNT ^y^tera £s In^iEtence that e certain unit {correa- 
ponditi^ to £i dimension associated with a niiiober) appear Id. the de- 
sired aiiswer . For example, epana ia the iiiiit apeclfled hy the ques- 
tion '^ov lattny spans equals 1 fathom?", an-d the answer given by 
SITX&SNT is "1 fathom iB B Bpana". 

Hie deductive OKidel deacribed here: is useful for «n£wering 
il^iiSSti-Otis because we kciciw how to eKtxact Implicit Informatloii froia 
ejipgrasBicKiiB In thia model; that ia, ve know ho^ to solve sets of 
algebraic equations to find nuiaerical v^ilues wht^h ^^tlsiy thc^c 
eqnationa. The solution process used in S-TUDENT i$ ieSitihd in de- 
tail in Chapter VI. The transformation prQQ.eas, b^fted on the theory 
deacribad earlier, which ETUDiKNT uses tD g[> from ^a English input 
to thia deductive loodel, is described in Chapter IV« 
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CiiA^TfeR III; FHCGKATMIMC FIQBHftLISWg AND LAKGiJASE MAHIPUTJ^TIQN 



Jilmost sny profirarreaiftg languas* U unlveiraal in the aense that 
with enough ttm*, *psce , atid work at the Imp Lement a t ion , any eomputablt 
*1±nttlr>n may be fT-pgriiminfid . Bowever, Ch* task of programming can be 
aiaJe much easier by the proper choice o;E rj higher level prolslem ori- 
etitciJ pro^raniiBlcie hmguage. The data tc te Mnlpul^ted by tHe STU- 
tlBNT system la symbolic, and cuf indefinite length and coapleiticy, For 
this reflsoDj a liit-procesainig IringLiagfe wj^s the rnii»t appropriate type 
Of programming for this ta*k. There st^ •> nujnber ef »ucli lajigu-igeB 
available, each having its t?i^ set of aJvantagea at*d n^iaadvaritSges. 
Ftor * deecri|tEi«^ of the g^ner^ 1 properties of list-proceee ing Lan- 
gudgea, with a d^t.-^iled ci^psrison of four of the IwjtteT krKrtJn Liat- 
pr&ticSBing UngMflftcs, see B^abrov and Raphiicl (5). Mc^stly because I 
ktieH it go weLL, I chose LIS? (Jl> aa Che basic language for the STU- 
DENT System. 



ITBLVe 



The LISP foifniali.s[B ia vtiy Qoavsnt^ftt for pragrarrniing recui 
taake such a a the reiving of a j^t of ElftoltSneoua eiciustions. However, 
LISr does not prwWe any natutal mechanisms for representing m^nipala- 
tleifl e£ stTingE of tcigliah words, r^nothet very iojpottint Bubtask in 
the STUDEHT system. For this type of manipuUtion onis would like to 
perform ;i sequence cji steps involving opera tiona auth sa recognising 
a senttnte fornat *hith fits a particular pactert-L, finding ttrtein el,,^ 
menca in a sentence by their content, reafranging a string of wrda , 
deleting, inserting, and duplicating part? pf strins*, and oth*rs. 

The LLSP f^rmilism caonpt easily express such string manipula- 
Ciona, chough fiaeh could be individually programuicd. Houevtfr, g for- 
malism fair ju*t this s«rt of Mnlpttlation ia the basis of tlie COMIT (45) 
prcfgr^Traning ayatcm. Rules In this fornialisB cati ftseily fstpress vefy 
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CwmplfiK String mantpulationE, and asf& easy to rcfid af\d write. Row- 
svei'j CQKIT ana LISP cantiaC be ufitHi b imu It aneous I >» , and the problem 
context tiacagsit^tfti filing back 8nd forth tetwefln LIS'P- oriented tfrsk* 
^wl OOMIT- oriented taeka. There foir«H J adapted ctie- OOHIT rule notiJ- 
tlon for use In LlSPi and conatruCttd a LISP progfflrti caLLed METEOR which 
would interpret Strtitg trans forma cicti fiileB in ttil* netation. 

In construct ittft the METEOR interpreter, I effcCtlvtly ea:Cen.'de4 
th« iloqiience o£ th* LISP proEraniinirt£ Lansuege; that ia » operations 
HJlitli could be <lont prfeviouBly, but were awkwUrd to Itmuoke couLd now 
be Mtpreaaed esieily* An eit ended laaia^aag* embodyinS tHe best feature* 
of OOMlt and LIHP ceuJd have been built fvam acrStctij. but it is much 
more ec&noHiicaL to aebieve such eKtflnslfltiE by embedding^ "Ehe advan- 
tages and disadTfiintflg*3 of lanEuige CKCetiaion by embudtltttg are discussed 
in detail by Bobrofw atld Weizenbauiii {^) ^ 



k. BpL-i^SCylua ^ Eesirfed String Forntft. 

KETEOR ha a been described in d^3Call tlaswhere C3)h but we in- 
clude her* fl brief suranary cf its fe^tuftfl^ We do this twcauee uae of 
the notation laakes IjSC'Et explication pF tUe trans forma tl*n proceaa 
eaeieT. In addition, if any arabiguUy beconiee apparent In the expl*- 
nation of the operation of aTUDEMT, it aay be resolved by consulting 
the llatlrtS of the STUWWT progrrtm £n Appendix B* Ifl this latter 
case, it way be necessary to consult the mote complete specification 
of METEOR referencfld above. 

A KETESOIL prografli consist* of a sequence of fulea each specifytnig. 
a BtTinfi transformation and giving BCime control irt forma t ion , Let m4 
first consiqlcr how a atring tea na form? tioo la specified, W4 shall 
call the (trine to be tracia formed the workspace . The norkspace will 
be transformed by a rule only If tt matches a pattern or f-onnat given 
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In the "Left half'^ of th* ri^ilc This Left hsLf ts a liat of ele- 
mentary patterns which specifies a lequence of Items that muSC be 
raatctied In the warkspace. For exaqi[>le. If the Left half wer* 
'■'(THE &0Y>" than a (tLatch *rauld be foaiud onLy if the 4nai:k«p«ce con- 
tained a "tl'lfc" itwiedtately followed by "BOV' . Irt addition to 
known c o na t it ueftt a j. one e^n iriatch unknown c£^netit^^ente + The eLe- 
inent |1 in a l^ft Half wLLL mitch any one workspace tpnstituent. ^e 
left half "(f^ $1 B $2 C>" will match a contiguous Bubatring of the 
workspace which t&nsisfcs ttf en A foLLowed by eKsctly one conatituent 
(specified by the marker "$1") foLLowed by a B followed by eacaetLy Z 
constituent* Cntatohing the "^Z") foLLowed by an occnrrence of a ^, 
Thus $1 %till mdceh dn element of the uovlc3pd^« with a specif ii^d tc<n- 
CHCC+ l£ 4: left half would raetcb Q^re thain one subetring Iti Che 
tf^rk^pid^e^ the left-meat auch substring 1,8 the ono found by the 
Hatching i>rscea£. 

We have dtscustod elementaTy patt&rtis which roitch a fijted cvuhTI- 
bar of iitikntvwn tonStituents (e.g., "$3" mattLhcS 3 unknown conetitu- 
ents). HF;TK>R aleo ha* an eLementary pattern clement "$** which 
Qiatchea art afbiti'ary numher of unknown co-nst ituenta. For eKatttples 
the left half {the $ soy) will match a substring of the WDi-lispace 
which tt^cis Kich an occurrence of "tHE" followed by any number of con- 
st itwent* (itl«tuding zero) foLLowed by an occurrence of "BOY" ► It 
woyldj for example, niEitch a aubatring ef the ^-ork space "(ClVE THE 
{jOOD BOY)" or of the workapace "(THE BOY HEEE)" . If the left 
half ($ GLITCH ?3) natchea a a wbs t n' t fig of the workapflcei then the 
elfetieiitaty pattern "^^ matches the Subatring ftooi the beginning of 
the W&rkepiSCe up to but net Includlcig the first occur rente of '"'GLITiCH*'i 
thft l^attern "C3LITCH" matches thia occurrence of "GLITCM" in the work- 
ipatej and the eLeraentary pattern "$3" matches the 3 alementa or 
constituents of the workspace immicd i a t e 1 y foLLowing ^IrlTCH- 
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EleiDenta tn th« worktpsce nwy be lig^v4 «f stibscrtpt&d to in- 
dicate special ^iTiapi^irtles of this e lenient:; for cx^inple^ one fnlsbt 
have (HAVE/VEHB) <iif (BQ¥/N01JM) as elements of tbt wopkapac*. Such 
ei«ai4ACt can ba saCched by nanie (using HAVE yi^ BPY as pattern. elementB) , 
or IdfttlClfifid juEC by their BubscriptS {or by both}. Xhe elementary 
pattern C$l/VERB) will match any single t&n&tttueat which 1b b verb; 
th»t is* ane which has the Bahseti[>t "V6RE'\ even if this conEtituent 
hss othtr sitbaeripta. Thua the left half (AlFHBU (Sl/VERB) BOOKS) 
will match tHe aubatring (ALFRED f READS/VKHfi) BOOKS) in the workc- 
Spac?^ {riijAi ALFRED (READS Af ERE) BOOKS tH THE LWRARV) . 

Other eLementary piittern ctem«rit3 are provided » and new pat- 
tern eLiftTieots can be defined and iss^lly used within the METEOR -Eyetem. 



U« have discuBEed how a ^e-itred f^Efoat can be specif isd through 
a prototSfpe pattern, called ^ left half, H we try to isatch the work- 
space to a l&ft hfllf> but it is not in the it?riaBt epecifiedf we aay 
the matth has failed . If a aubstrlnj of th< inockapace ia in the speci- 
fied format* the Tjuatch ia auccessfulf Hhefl there ia a auccesafuL 
match ► wc mfly wtah to transform or mfltiipulate the aubatring matched, 
OT place in a temporary storage lescetlon, called a shelf, Copied of 
segnwnta ef the matching EubsErinft. We aha 11 now discuss the nota- 
tion umd for specifying Such t I'ene forma t iona , and storage of nmteri^l. 

A left half is it aeijuB-ntt of elementary patterns, and we ;JEEoei- 
Ite with each elementary pattern a number tndtcatinE its positl&n lit 
tbia left helf sequente* for example, In the left bale (§2 E» $ E) , 
the flriSt elementacy pattern. Si, would b* a^aeciated with the tiumber 
1, the aecoftdj D, with 2, 5 with 3t and £ with 4. If a match is suc- 
eeaafiil, each elementary fattem element in the left h«:i 1 f mattticS a 
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part g£ Che aubatrlng of the ui^lfkflpac-e BnatcKed by tht^ left half. The 
part matched b^ an elementary pdccern can then fee referenced by the 
fliirribEr asGoclat^HiI with thia <: lemjentary paecem. For the left half 
given fl-bove, and the Korkapate U B C D & A E Gj , thf left-half niaeth 
Qtittceds, and the subBtrlng (B C) may then bs reffirent&d with the numr 
btr 1. the atibstirini; (D) bj^ i, (E A) by 3, ^nd (E) by 4. 

The trafts formed worksfiace is apeclifled by the "rlglit half*' 

Of a HETEOfi if ale* Thia riglit h^lf nay hi juat the flamcri.! p, In 
Whtch case the watched pttrtlon of the workSpEice ia deleted. Othet- 
l#lSe this right half niiat W a Ltat of elernenCa ap^eifying a reflace- 
itient for the matched aubaCrlnS^ Any numbcts in Chla fight -ha If li«t 
reference (apccity) the appropriate part of the niaCchcd e uba t r img . 
Other itama in the list may reference tliema elves, or strings in teifl- 
porary storage ^ or functions of nuy refer ^nc^abLe substrings. In 
the example dlecossed above ^ if the right half were <3 2 M 2 H) , Chtn 
the mdtehed po^rtitm of the uorksp^^e would be replaced by ^B A D M D H) , 
and the workspace would become (A B A D M & H G) . Note that 1 and 4 
were not mentioned in Chla rigltt tuiLf and were therefort deleted from 
tht worfcapace. AUo 3 and Z Were in rever*e order, and thua thege 
re£*renced part* isrere incerC^d in the worke^aee in an order opposite 
to Ch*t in which they had appeared- 2 la referenced twice in thie right 
half and therefore ttro copies ^i this ref&rented aubatring, "(D>'^ ap- 
pear in Che iwrkapste. The eiemenCc M aeid H In thia right ti.iif refef- 
eBce only thecaeelvM, and are therefore lna4rced directly into the 
u»]^(^£piice. 

Uaing Che right-half ^lanEnta deacribed> that U, numbers 
reearenclng fliatthcd aubstrtnga ^jnd conatflnts (elements referencing 
thcmselveajj one can eKpre«S trflnEfonaationB of the *?cfrfcapace in 
which clement a liave been added to^ deleted froCTpdiiplicated in, and 
rearrartficd in the workspgce, Elemcrts to be added to the Workspace 
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thus far can imly be= constants* Let us consKJer aooie other possthLe 
right-haLE element* » They are all ifidii:ate4 by Lists which start with 
spec id L fLass. 

Ttie contents 6£ sny ahelf {t*^iporsry *toca6e= Liat) tan be 
Tcfet^tie&d by a two element List with first cli^ent either *A (£or All) 
or *H (f^>r Sent), SAd a second elemenC, the shelf name. For eKflmple, 
<*A E-qfT) reference* the entire eofltenta of 9 9bel£ naTiued EQT. If thia 
element appeared in * pislit half, til* entire cont«fiCa of that shelf 
would be placed in th* correeponditie place in th« voiftEpaca, Tbe 
first element of a «!]«lf named 3E1TTENCKS could b* ptit into the work- 
space by uiiciB the elSflifetlt (.*N 3ERIEKCBS) in * riaJtt half. 

Thft flaa FS 3B the first memhet df a list ftrvina as « righE:* 
half elemtnt indicate* tisat the ncKt BerBber dJE thU list is a funetiQ-n 
itauie, and the following OrtSa are Che argimients of thU function* The 
Tfaliie of the function for this set t>f argu-ments is placed in the 
^rkspace. In Eh la way, any tl&P fiinttiort can he uScd within a METEOR 
tule- 

The flag *K indicate* that the r«aC of the li«t following i« t* 
be evaLugted a* a right-half rule, iSftd then la to he "conpress^jd" 
irtto a list which will bnJ a ainglB clesnent of the wiifkapace. TtruSj 
chunks which are longer, and have more complex structure than S 
single word cati be treated ae a aln^le Mnit within, the METEOR 
workspace string. The inverse opertftten la the expansion of a cbunM 
BO that all Iti components appear as individual const tt(?ents in the 
wOfttepace, EKpansion is indicated by a *e flag Bt the beginning of 
« rtght-hfllf eleaient list. 

Wp have thus far d la cussed how the t]fafl*fCirr»ation of a at ring, 
called the workspace, can be eKpressed In terma of a left half whUh 
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is a pattern foc a <l«SLrcd input format^ and * rigjit half which la a 

patteiffi tax the desired out put fonnflt+ There is no te^So-n to limit to 
on^ tbe fiimbet of outpuCe fiMi a single left half natch » In facCp a 
third auction oi o METEOR rvU, tailed the "routins **Ct ton" (for 
historical rtatoniE) , alloua th« pfOgramncr to give anv number of oth- 
er right halves, ond place th«3« referenced Liata aC the beginninB or 
end a£ any ihtlf {temporary storage 14(t) * The aCOragt of such a 
■'^TtfiJiC half" is i™3it?Jt6d in the towtiTlg ie-ction by a list f tenting 
with a *S ct a *iQ, followed by th* shelf name, and fo-llow^^ by a 
right hal£ paEtetn- The *£ indicates that the referenced raacerial ta 
to be Stored en tha beginning of thft named shelf. *Q ifldtcaccs Chat 
it ahouLd be <^u«ued on the end of the sheLf * Used with a *N tat re- 
trie vai^ a ahelf buiU up by a *S ia a puahdcwn List, <fl last- In- 
£iTBt-out liat>> and a *helf built up by a *q is a queue (first-in- 
first-ouC 11a t^ . 

The o-tily ^thcf significant featw.re *f a KETEOR proEram thai we 
h#ve not yet t<niflhtd &ii is the control structure in s aeC of r files* 
A METBOft rule ha^ a naine^ and haa a "gc-to" section. Ordinarily, if 
the left-half aaech f^ils, control i* automOCically pasaed tfl the 
next rule in se<)u«nCQ . If the left-half match succeeds, th* tight half 
•iHi routing secticnj are inCerpretedj and then control is pa^^^d to 
the rule named iio the "e,o-Eo". Hoveverj by insertion of a "*" 
iimediateLji afttP th6 rule name in tht rule, the method of transfer of 
controL ia awitched , and only on left-half failure will control, piiss 
to the rule named in the "go- to". 

Routing tontrol can also be changed by a list *f th* form 
"(*D namel nacieZ)" in the routing section o£ » rule. J^ftes this Hat 
is interpreted, any ^teurrence of namel tn t ''go-to" will be inter- 
preted «s a ^'go-to" CAi^tainlng "n€me2". Tht* Utter feature a Utjws 
easy return from subToutinss. The uae of Left-hal,f success or failure 
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TJE ^ iwltt^h f4r t^^ transfer of contra L makea It poseibl^ ta write sig- 
ntlleent onti ruU l»»^i- 

A HET^Of!. ^i"[>graffl is a sequence (iist) of tuI€b. Eich rule is 
9 llct 0f Li|> Co six elements . The followLn^ is <in example' o£ -^ HETEOR 
rale cant sin ing all slie elements: 

(MAME + C$ B0V) (2 L) C / C*S SI ^ 2) (*D Pi P2)) PL) 

ff< ihsll briefly f^TH^iew the function of each of these six elements ^ 
Th6 First tl^m^nt of a MtLiTEOtt rule is a lunnE: , iiTid must be preseat 
In anjr rul^+ If no name is needed p the dunnay name "*" csxt he used r 
Thft eet&nd elefflftrtt is a "*" and is optional. When it is pres ent it 
reversed thv- s-wL^dh c^n fli^w of control, and Cransfer of control to the 
CuLe nartl^i in tl** "go- to" is made on left- ha If f^ilurg ^ 

Ilie third element is ma;nda:tory, and i£ a left-half pattern 
iwhit;h is tp be matched in the workspace. "Rie fourth element is 
Dptidnal^ And is a right- ha If pattern specifying the result in the 
workspace of the string trans, format ion ds^iT^cId Thfi fifth (optional) 
«l$iti«rtt is called the routing section, nini is a liit fjsg^^d t^ith 
a "/" ^9- ^ fii^st element. The renijiinder o£ th« tcuting S&ttit>n i& 3. 
^^qtf^nce oC lists which specify operotion^S >^lch pls^Lc It^^S <f<i 
shelves or set "go- to" values. The final ^l^jm^nt is tailed tht: "Sf*- 
to" and specifies where control is to b^ pamd if a mstdn su^c^^^s 
(In th<i normal case>. A "*" in thi* position «p«cifief thft ne^xt rule 
in sequence. 



In this <;hai>t^Yj, we have briefly sumLarixed th« f^atui-e* *f a 
L»n$vage for string manipulation which has been embedded (by bpildlng 
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the MfiTEOE interpr^tujr) in Che gttvera L LUt-proceBsing Linguafie LISP. 
The aMlicy to descrtte easily in METBOK the strine transfornLatton* 
needed to proceee EngUsh sentences, atid also use, wtiefe appropriate. 
the fuiictLonal notatl™ of the gcnflTal llaC-processt?^ language, LISF, 
iraa a great sdvanCa^t in the prfjgramming effort involvt^d in this study. 

A3 a final ±Llu*tratioii of the power of the coHiblned MElBOR-LISP 
LangLiagv, we include s program iat Wang'e aLgorithm for proving 
theor-siis in the propositiona L taLculus. This aLgerlthm ia dtitrlbed 
en pages «-45 o§ the LISF «ana;,L (31), ^nd a USP prosram tot Che aU 
aorttha appear a on pages 43-50. FigLirc 3 below containa the CompleCe 
METEOR program for the algorithm, incliiding deflniticna of four 
Binall auxiliary LISP functiotia uAed witMn the HlT^R profiratB. 

Irt ^ddiciacij the figure tontaina a ttace of the program as it 
provea tb* theorem given after the first lltie containing *' {IlimBEH)*' . 
nie othe^r lines give Che Cheonsms that aire proven by the alaoeichm a a 
steps in the proof of this Cheorem. This METEOIi. program compares 
quite favorably iti bwtti sise atid ynderatand^bility to the one given in 
the LISP TT,anu;.I. and tfl the one OOHIT program which I have aeen vhlch 
perforaas the Hang al^^rithm. 
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CHAFTEft IV: TKAMab-tlliHATlQN OF ENGLISH IQ THE 5TTJDE3!fT PEDuCrrVK MODEL 



The SnfOEHT systmi coiulctt of twio Tai\±n. aubprasrams^ Cdll«4 
STUBEHT and HKHRHfiEEl* The progrim called EEKEMfiER acetpti ind pro- 
cesses sCatemencs 'w'hith tont^itn global inforaati^n; th^t is, in- 
fomLatlon ■whicli i^ iwt Specific to any one Eto-ry prolyl em* We shall 
discuss the praceasiftg snd Information etorage tecViflli^uea used 
in ROEMBER in th* neitt thapter* A listing of the 3 lot*! infornw- 
t±on Stvea, to th4 STUDEMT ayatem may be fo^ittd In Append lk C. 

In this chapter, we ahall describe Che techniques embedded ttl 
ehe STUDEHT progT^m which are uaed tO tlfanifoniQ an English statemertt 
of ^n ilgebra story problen ta estpriesttonS in Che STUCEWr deductive 
HOdel* By implication we are alsi> deftninj the aubaet of English 
1«hlcli ia ^^understood" by the STIFDEHT ptogiramd A iBore explictt des- 
CTlptLom of this input language i.9 given at the end of thft tbiapteri 



A. Out line of the Qp^gattan cf 5TIII>EHTt 

fo pri'i-uide ptripective by which to view the deCiailed heuristit 
technique B need in the STUDENT program, we shall first give aa out- 
line of the operation of th* STITOEHT program when ^iv«n a problem to 
solve. Thia outline i» a vttbJlL (JJescription of the flow chart of 
the progran fourwi in Append in A. 

SnroEHT is eiked to aolvc a particular t>rohlcm* Wc ussume that 
all neceaaatry global in formation has been Etor&d previouitty- STUDEHl" 
will now trans form che Engliah input stateisenit of this problem into 
expressions in it* limited deductive model, stld through Appropriate 
deductive procedures attcnpt to find! a solution. More specifically, 
STUDEHT finds the kernel aentences oS ths input discourse, and trans- 
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forme &ht5 sefliience af kerne Ls Into a Est of simultaneous equations j 
keeping ^ list of the anawers required, S Hst yf tJm unite linuoLved 
in tliE pTotlBBfl (e^gni dollars K pounds) Snd a list yf all t>ift variables 
(aiuple n^cnss) in tlite equations. Then SrUDEHT LnvylitiS Che SOLVK prosJfani 
to flolvE Ctiis Set sf equations for the desired unknowns* If a soIm- 
tion 1b fmind, STUDEWT prints the values of the unkfto^rta requested in 
B fixed fDrmSt* SuJ^St ittitinfi in "(v^ri^ble IS v^Itjp)" Cbe apprapifiate 
phxaoea for ygiriabl^; aflft value. If a solution cJnnot bft f ourttf j 
varioua heuriBtltS af* i^^ed to identify two vaTiablcs (!»*' fiiwl two- 
sHftlitly different phrajc^ tVist refer to the same object in the nicKlf]) + 
ti t«o variables, A AwA B^ are identified, the equation A ■ B is added 
t« the aet of equation* h In addition, the store of glob:.il infornia t i&n 
is Sneairclied to find any enguati&uS tliat nay be useful in finding the iela- 
tl&n t9 this problenj. STETDElfT pfirtta oat any asaumptions it mskes ^botit 
the tdedttty of twa Tari?ible9* and alst« any equations tbdt it retficvti* 
because it thltike Cliey may be relev-Jnti If th* wse of global equa- 
tions or flquatioAS from identtf icaCions Icsds to a solution^ the an- 
swer a dre ptiiite4 Out in the foroiet des (bribed ab4ve- 

If a aolutlon was fltSt found, ami certain ii3irj«nS are pTesent in 

the problesn {e result of a definitional trans forniatitfl ysed in the 
generation of the prdbltia) ? a aubatitutlon is made for each of Cheae 
idioma in turn andl the t f a na f orma t ion and Solutl&tt process ia re- 
peated. If the substltuttona for these idiDiTiE dp flOt enable Che prob- 
l«n to be Holved by 5TUl?BMT> then STUDENT requests additional informa- 
tioci froiB the quest icmer, showing him the vStisbl** being uaed in the 
problem. If any infctmatioai is given, STUDENT tti«* to aolve the prob- 
lem again. If none is given, it reporta its inability to aolve this 
problan and terminates . Xf the problem 1* ever aolvfrdn the solution 
Is printed and the pri?grgni tenninate*.. 
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"Si- Cjite^orieE r>f Words in a Trans format ion , 

The words and phr-ss^e (Btringa of wiirds} In thp English input 
C*n be clissifiecl into thrse diatinct catesppi-** On the bssis of how 
they yre ti^indled in the tranafo-raiat ion tC tln« (J*iliiecive model. The 
first category consists of Btrictgs &f 'wptds which nam-e ohJiecEa In th« 
■odel; 1 call such strings, varla^lfcs . Variables are idenEified otily 
by the atring of words in theoii aiw^ if two strings differ at allj they 
define distinct vattflliU'ea . One- important problem considered be lew 
is how to detei^ia* wh^ti two diatintt variables refer to the same ob- 
ject. 

Tl'ift eecfsifid cV^jS o£ words ;ind phrases are uhat I call "Substitu' 
tore*'- Each sulist ttuCor may be replaced by anoCher SCring+ Some sub- 
sC it lit ions are maiiiila tg P y ; ■others ^re optional and are only made if the 
probleoi cannot be «alv«d without such substttuEionB. Aci example of 
s matwiatory s iib^ 1 1 ty C ion is "2 times" for the ward "twice" + "TwitE" 
always meflrtS "2 ttnuG*" in the context of the model, agid therefore this 
Substitution it m4ndaCory. One optional ^'' id iobaa t ic " substitution is 
"twice the *um of tbs length and width qf the eecCangle" for "the peri- 
mtCar of the teecangle". The use of these Substitutions in the trana- 
f otTTifiit ion process ia diacuaaed below. The^e Substitutions are inversee 
Qif definitions! Cr ana farmat ions as defined In Chapter II « 

Members of Che third eiaas of uofdls indicate the presence of 
functional linguiatic forma which represent funttiona In the dediictlvt 
model. I call member a of t ttis- thttd cL^«& "operators". Opera tf? If a 
may indicate operations which ate Camp lew coinbinationa of the b^sic 
functions of the deductive model. One simple operator is the ?J«i*d 
"plus^\ Hhlcb IctdlcatuS th^it the objects named by the tWO variables 
surrounding Ic are C& bt sddadr An eKample of a more ctmplwK operator 
is the phraee "percertt lesn than"t *s in "10 percent less than the 
PiBifked ptiee", which indicates that the fiuwber inmedt*tely preceding 
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the "percent" is to be aubtractetl from 100 j thig reauLt divided by 100, 
^Tnd then this q.!iottent multiplied by ChG variable foLLowins the "chan"* 

Operator* w^y be classified according to wher« Eheic a r ELunent a 
are found* A prefix aperator , Buch as "the »i|uaire of " prscedea 

Its argument ► An opePfltdir like " . ^pertent" 1* a auffix ti-p-eir a t or ^ 

and foLlcws it* argurneriC ItiftK operators auch as "■ > , , .plus , *. ^^*^ 

or " less than i,. p." appear between their two arguEMfiES . In a 

split prefui. opfltatot inch a? "difference between- ■ i , laftj , . , . ."j. 
part of the opetaC(»r pr««ed^;is and part appears betMeen ths tVO 

argunenta. "Ihe auio o£ and < < 4 + +and, , . . ." ie a aplit prefix 

apeiffitair with an indefinite nuHiber of ars«ments. 

Sottie worde may act as ope rji tore tejfld 1 1 lofla 1 1 y j depending on 
this it C«tlteMt- Fox eKaraple, "of" ia equivalent to "^tlmea" if there 
is « ftattlwi iBBtediately preceding It; e*^. * "-5 of the profit" la 
equiviilenit t* 'N5 tlaea the profit"; however, "Qu^en of EnsT-^nd" 
does not imply S nTulti|>liefltive relationship between the Queen and 
her country. 



e;. •TJ^atiefg-rmationr'il Vio^e^ist-t^S' 

Let LIS now consider in <letail the trana format ion protttdurt u5ed 
fry STODWTt aid aee how th*(e different categories of phrssc* Inteeaet . 
l!o make the proceaa more c&fisfretej let us consider the fslliS^rlrtg example 
wtiith has been aolved by I^TUUEHt" 



(DIE PKDELEH TO BS SOLVED IS) 

tIF HIE N1.IH1SH11 OP CUSTHMEES TOH CRTS IS TVICE THE SQIMRE OF 

20 PER CEHT OF "EHE HUMHEE OF ^IJVERTISEMEMTS HE BUMSh AWD TUf, 

MIMBER OF AD¥EEriS13tEaTS ll£ SLlfifS IS 45,. WHAT IS THE HUHBEK 
OF arST0HEE5 TOM GETS Q.) 
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shown be Low are copies of actual printout from the STlffiKNT pro- 

I- 

^0in, IL lust rating stages In the trfln»£oniiatlan acid the aolution of the 
problem. The pjirenthecBB sib an artifact of the LISP prografffliilfL:^ l-an-- 
gu^ge, rind "Qr''^ is a Teplsceuient Car the question. ttiarV. tiiift avaitldtLc 
on the key punch . 

The flrftt stage In the tranaEonnation 1.b tc perfoZD all tbAihI*^ 
tory subat Itutlone . In this probLem only the Chirtfr pbrsCH underlined 

(,h7 the author » not the pragram) are 3uh3tltut<jirs ; "twice" beccrmed 
"2 time-a", "per cent" hecomea the single vord "percent"* and "square 
of" tl trunestad to "square". Having made theae aubst ttuttons , STUDENT 
print I : 



tWTTH MANBAT&Hlf SUES TITUTIOMS THH rUn^LEH IS) 
(IF THE NTMBEH OF CUSTOiMEKS TOM GE^S !$■ Z TIMES THE SQUARE 
?0 FERCEMT OF THE NUKBEB OF AWKRTl S fSIElf TS HE RUNS, AMD THE 
HOMBER OF ADVERTISEMENTS HE KUHS IS 45 1 m!AT IS THE STIMEER 
OF CUSTOMERS TXM GETS q.) 



Prom dictionary entries £or «ach WOtda the words In the pToblem 
tre tagged by their function in tertiW of the trans formait ion process* 
flnd STUDENT printa: 



(WITH WOElJS TAGGED lY FUNCTION THE PROEL^ IS) 

(IF 1H6 HUHEER [OF / OF) CUETOHEHS TOM (EJITS / VTIEB) I5 

2 (TEteS /OP 1) THE (SQUARE / OP 1) 20 CPERCKHT / O'P £)COFyOF) 

WE MllKBRB, (OF / OF) ADVERTISEMENTS <IIE / PSO) RUNS, AND THE 

HllMBEK (OF / OF) ADVERTISEMENTS (HE / FRO) RUNS IS 45, 

(WHAT / q«10HD) IS THE NUMBER (OP / OP) CUSTCMERS 

lOM (GSIB / ?ERE) (QMARK / DIM) J 
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If a word hflQ r tagi or tags , th-e word folltywed by "/"j followed ty 
ttie tdgs^ liiSC>^Ti«3 a single unit, end la ertclDsect in p a renClmS uS + SOmfi 
typiciil t^^glnes are shown above. "(OF/OP)*^ tndicaCEa that "OF" ts 
^n operator and other tafi^inEE ahow that 'V3ETS" is S vc-fbj "TlHBS" 
1b fln opetatOF of level 1 (operator le:vela ■will be explrJlncd bcLotj) > 
^'SQUARE" it an opeicatcif of level 1, "PEftCEaJP' is an cjpar?Jtor o£ Level 
2, "HE" is ii pronoun J "BBAT'' is a -q neat Ion wiord , and "^^tAEtK" (replac- 
ing Q,) is if rfELlmit«1f of a setiteuce. Tliese tagfied word a will pl^y 
the principal rolc in the reui^lnitiiS tranafoniiation to the sat of 
equations implicit in this problem statement p 

The next stage in the trinsfottnatlofl is to break the input sen- 
tences into "kernel a«nten-C*s"p As in the eitaEBple, a problem may 
be stated usin^ SBntenc*S <t£ great graiumfltieal comp 1 e^ It y ; however, 
the finiil Sta^c o£ the ttansfosi'fflatlon ta only de fined on a aet of 
kernel sentcnCSSi Itie aitdrpllflcation to kernel sentencea na done in 
STUDEKT depend* on the t<SCiitsivu lise of foitiLat matching- If an in- 
put sentence Is of the fofrt "IF' followed by a subetring, followed by 
a comma » a question word and a s^cofid eubatrins (i.e. it matchea the 
tfETEOR left half "(XF $ , (■?!/ qW>IlD( ?)" > then the fir at aubatrlng 
(between the IF and the conmia) is made an independent sentence, and 
everything following the eoram^ iS made into a second sentence. In 
the example f this means that the input ii reselved into the fol- 
lowing two sentences, (where ta.g;4 are Offlitted Cor the sake of brevity^ 



"The number of cu*toimi*S TOni get* is 1 times the 

square 2.0 percent of the number c»E ^■^Cv^rtis-entenCS 

he rmns J and the number of advert is ement* he funs 

is hSJ' and "What is the number of tusttunc f s Tim &et3?' 



This lifst procedure effectively reeolvea a problem into declara- 
tive assumptions and a que-SttOfl se^-te-ftce. A aecond comp lenity reeolved 
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by StUttEMT Is illustrated in the £irSC seritetice of thU g^ir. A co- 
ov(ftEtat« e&rttence conaiating of tfjfi ientencea Joined hj « conmia im- 
m^d lately followed by aci "and" (i.e., any SE^nt^i^te mat ch tug the 
HETEOR left half "($ , AWCt $>" ) wili be rea&lvijd into these tw in- 
dependent senCencea. Thfr firet senteDce above is therefore i:eacilv«4 
Into two simpler sefitencns- 

Ifsing these two irtvetsc syntdctic traii^f^fraationa ► this ptob- 
Icm Statement is resolvett Into " a imp L^'^ kernel scnCenicea . Fisr the 
eiiaiHple, BTLfDEWT print* 



(THE simple:: sentences are> 

(THE NUH&ER ((JF/OP) CUSTOMERS TOM (CET& / VERB) 13 
2 (TIMES /Of I) THE (SQUARE / OP 1) 20 (PERCENT / OP 2 J 
<0F / 0P> ms NUMBER (OF / OP) JiDVBHTISEMEITTS (HE / PfflO) 
RLTifS (PEftlOl] / DLM» 

(THE M UMBER <0F f 0P> ADVERTISEMENTS (HE / PRO) RUNS IS ^i5 
(PERIOD / ULK)) 

((WHAT / QWJJtU} is THE HtMEER (OF / 0F> CUSTOMERS Tt)M 
(GETS / VERB) (QWRK / DLH) ) 



Eiich simple sentence i^ e separate list f i.e,> is encloaed in paren- 
theses, and each ends with a d'SlicfiiCer {a ^trioJ or q4^e¥tlon mark)* 
Each of Cheae aentencea cdn now he traciBforaiB^ -directly to tte in^et- 
ptc:t4C£on in the cnodel. 
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p. From KfiTiiel Sentences to EquaCione,, 

The tranfiliorrnfltioii fron the sicnplc ktm^L sentencea to equ,?- 
tlona uses three level* * f precedence for eperatoifs . OpeiraCorB of 
hlgheT precedeTice l«V«l are uaed earlter in the tifanBfarEation^ B*- 
fore ucilizing ch* c^esfatora^ STDEEHT loots foif llneulBtic torms 
flHHociated vith tli& fefluality relation. These forflaa ine Ludei the CopuLa 
"ie" fltid tranaicivB vcrti In certain conteistft- Itt fhe eacarapLe we ara 
cotiiid&ringj mily th* copuLa "la" is used to 4ti*51cat(S equality. Tha 
uae of tranaitlve verbe ae indicators of e^Tiallty> tfiat is, as Tel#- 
tional linguistic fofm»> will be diecuiJcd In connection with another 
ejEample. When the rel^t ic^mal liBguistic form il identified, the 
names which are the arguments of the form are btolctfl dewn Into 
variables and opeTdtQfS ( f iific C lona 1 lingalsttt £of Pit) ■ In Che preaant 
problem, the tT»o n^mffiS ^P* those on either side 6£ Eh* '''±9" in each 
8*ntenjce . 

The word "la" may iilso bt used meaningfully within algebra 
BCcry pr«^leiffla aa an ainillaty verb (not nieaning equality) in such 
verbs 1 phraeta aa "ia multipLied by" or "ia divided by". A special 
Gheck Is mad^ for the octutrtflce of theae phr^ae* befoife pinoceedine 
00 to the main trans EonBflttOfl pif «3cedv.T e . The trinsfafiaaclcin of aen- 
tences tontaintng these special vtrbal phrases will b* dlacuEBtd later. 
If "ia" d04S not appear a an ayjiiliary in such * vcrhal phraaCf a 
sencetic* of the forin "Pi is F2" ia interpreted as indicating the 
equality of Che objei^tt named by phrases Fl aftd f 2 . No equifllity 
relation tflU be recogniKud within theae phrasfSj even if an appro- 
priate transitive verb oOOura within either of tb-em- If PI* anS 
PZ* represent the *rtthm*tic tranaformaCionl of PI and P2 , then "El 
is Pi" is transforms J ti^to the equation 

"< EQUAL PI* P2*>"* 



5B 



Ih« trans £oniiaEit>n of PI and P2 to glvt theui an Interpretation 

Cn the morfaL is perforaad r etu-rS Ive ly using, a pfjjgrani equivalent to 
the tabic in Figure U. Thtf table showa all th« operators aa^ for- 
qstl turrentLy Trecoaiiltcd by the aiUBEST pvci«T«rni. Hew o|ier5tora can 
easily be added to tTi« plf(&;fti'*in equivalent tjf this table. 

In perf-orming th« tlf*ns format Ion *f P phrase Ej a la ft to 
rll^t iearch is Bade f&r *ti operator at Itvftl 2 <±ndicflt*d by sub- 
eci-tpt« o£ "OP" and 2). If there is none, a left to right Search ie 
msde for a level 1 op-tfStor <iTOiicated hy iubscripta "OF" Snd L) , 
«nd finally anoth«tr left to right anarch it made for an operator of 
l-&vel (indicated by ^ subecript "OF" «nd no numerical iubacript)^ 
Ttie first operator found in this ordered search deterwines the first 
*tep in the tran^fptmatton of th& phrssc+ This operator and it a con?* 
text are trans formed «s indite ted in tolumi A in the table. If tw 
operator le present, delimiters an-i articles (a ^ an and the) are de- 
leted, and Cht |h!hr*fB is treated as an indiviaible entity, a variable ^ 

In tliB eKample, the first Blmple sentetite is 

(THE NUMBliR (of/of) COSTOMERS TtM (GETS /VERB) IS 

2 ilH^S/OT 1) THE < SQUARE /OF 1) 10 (PKRCElfT/OP 2) 
(OF/DF) the HHKBER (0F/0F> AUVERrrsigiEMTS 
Che/PBO> KUNB (PE£IOD/DIil)> 

TbU is of the fcraj "Pl is P2"> and is ttansfomied to fEQUAL PI* 52*) » 
Fl is "(THE WUH&ER (OF/OP) CUSTCMEdS TOM (GETS /VEIUi) > " * The occuv- 
rence of the verb "gets" la ignored because of the presence of the 
"Is" in the sentence > rucaning "equale'% The only opersCor found 
is "<OF/OP)". From the ti»ble we aee that if "OF" is Imaed lately pre- 
C«<!e'd by a number (not the word "numbfer") it ia treated as if it 
were the iftfilt "TIMES", In this ca^c, however, "OF" i« not prectdtd 
by a number; the tubicript OP, indicating that "OF" it *a operaCcrj is 
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fltrlppgd away, and the trans foifhia t ion proce*» it repeated oti the 
plii:a$c with "OF" ei& Longer acttttg as sn opera tpi-+ In this rep-eti- 
tiowt lifJ operators arc- found, and Fl* ia the varifltle 

CMiaCBEE OF CUSTOMEU^ TOM (GETS/VfiSB)) . 

Tiy the right of "IE" in tht Sentence Ijp P2 : 

(2 (T1ME&/0P L) THE fSQUARE/OP L) 20 (FEKflENT/Of 2) (OF/OP) 
tHE HTMEER COF/oP) ADVEETI££MEkT:5 {WE/FBD> RUWS (PE EIOD /DlJi) > 

The ftr^t operator foynd In P2: £b FERCEMT, an operatcir o£ level 
2* Frcjin the table in Figure 4* we see that this operator has the effect 
of dividing the numher InaBed lately preceding Ic by 100. Tbc "PERCENT" 
La reiM-ved and the tranif^rHiStion la repeated on the remaining phira.4i^. 
In the *KSmple, the "...30 fPERCEffT/OP 2> (OF /OP)..." tNCcomea 
"... .2000 (OF /OP)...., '\ 

ContLnuing the trJins forma tl<in, the operatc^rs found are. In 
ordtf, TIME5, SQUA^, OF ,-ind OP. Bach is handled a* Indicated ift 

tht t*bLer The "OF" in the contest "... .2000 <0F/OF) THE " 

is treated as an tnfiK TIMES> while at the other occurrence of "OF", 
the UpftrStor marklTif U removed, the resulting transformed espreS- 
El<nji for P2 la J 

(TIME 2 (EXFT (TIMES .2 (NUMBER OF ADVERT IS EMH^tJ? 
(HE/PiM)) WnS)) 2)) 

The transformation of the second sentence of the example is 
d<nri,e in p alTTiilar manner, and yields the equsCion: 

(EQUAL (NUMEEli QF ADVE31TI3 EVENTS CHE /PRO) RlCffS) i5> 
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The thtfd eentance Is of tha form 'Vhat la PIT". It st^lrts with 
3 question word and I9 therefoTe tTeatsd sfK^tilaLLy. A uniqus varisblt* 
a single word consJ-atlng of an X o£ G followfrd Ijy five Integers, 
Ifl created, *nd th* equation (EQUAL Xnnnnn FL*> ts stored . For this 
example, tKe variable XOOOOl wae creatt^lj and tbla; last eirapla sen- 
tence la transformed to th*; equation: 

<EqUAL XOOOOl (HTJKBEJl OF CUSTOMERS THM CGETS/VKRfi) > 

In addition, the tr^^ated vaifialJle ia placed OH the Hit ^>f vsriables 
for x?hlch S^nJDEHT it to find a value. Also, this variable is stored^ 
paired wltti El, the ualClf afiJi f <S if »e.d right side, for uSc itt ^f int ing out 
the answer r l£ a -valve I3 fii und for thia variable, STUDEITT [n^ints the 
sftBtence (El is value) witJl Cht appropriate sutatitution fsr value* 
Below we show tha full S4E <t£ e:quationaj and the p^inte^3 jSoluiticn given 
by S^^JDEHT for the eltflnirpl* belfig considered. For ease in so'lutlon> the 
last equatione created iffl put first in the iiB:t of equations + 



(TOE EQUATIONS TO BB SOLVED ARE) 

(EQVAL XOOOOl (NUMBER OF CUSTWKSS TXM (GETS /VERB) ) ) 

(EQUAL (NUMBER OF AD¥ERTI Sl&*EHTS (HE /FED) HUHS) 45 > 

(EQUAL (NLMBEE OF CUST0HER5 TOW (CKTS/VGftfi)) (TIMES 2 (EKFT 
(nraS .2000 (TIMBER OF ADVERnSKMJiNTS (ttE/PRO) RUBS)) 2>>> 

(THE iniMBEIi OF CUSTOHER& TW CSTS JS IM) 



In the eKaraple juet shown^ thft tq^iality relation waB inditste^ bjf the 
copula *'±b". In the pfpblifm atiown below, solved by STUDENT j tqi^allty 
is indicated by the sttyrtence of a trans it ive vetb Iti the proper context 
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<T1IK fROBLEH m BE SOLVEI> IE) 

(TOH HAS TWICE AS MANY FISH AS MARY HAS GUPPIES ► IF MAHY HAS 

3 <iWFlE^t WHAT IS IHE NUMBER ClF FlSfl TDM HA& qO 

(TIEE etJUATlONS. 10 BE SOLVED AEE) 

(EjQUJAL KOOOOl (NUMBER OF FISH IDM (HAS/VKRB))) 

(apftTAL CmMEER OF GUPPIES (MART /FERSOPI) (HASyVEEB)) 3> 

CEQUAL (number of fish TCiM (FCAS/VEEB)) (TIMES 2 (NtWEEIl OF 
GUTFIES (MARY/PERSOM) ( HA$ A^ l^iiU) ) ) > 

(tHE NUMBER OF FISH TOM HAS IS 6) 



The verlj ±n thU case ts "hae". Th& ilm^le sentence ^^Mary haa 5 

gupplea" i» trans formed to the "equivalent" Sentence "Tlie number of 
gupples Mary has ts 3" and the processing of thia latter a«v)t«ilO« ii 

dam as ptcvltjusly diflcuflsed. 

Hi* gerwi'^l format for thlg; typ« o£ Ecntence, and the foimaC 
pf th* ItitotmBJiflte sentence to which iC i« CrSna formed I5 best CK- 
prea:3cd by tUlft foUo-wing METEOK rule; 

(^ (5(31/VEHB)(^L/8tWE£R) ^> (THE NUMBER OF 4 I 2 1£ 3) *} 

Thia rtile frtay bfl Tflild : any thing (a Subject) followed by a vtrb foL- 
loHsd by a HMtnTji^i: £dllc!ved by anything (the unit) ia tranafofcnfed to 
a aentenc* etaT?tJ,n;g with "THE HUHSER OF" foLLowesI by the vnit, fol- 
lawed by the aybj^ct and ChE verb, Ealltfi^td by "IS^' and theti the 
number. In "U^ry tia« 3 guppiea^" the: eubjett ts "Mary"", the verb ^'has", 
and the unit a "guppUs". Simi laxly, the a^ntunCe "The wttchea of 
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FirCli liir^fH 3 Tn^glc patlcna" vouLd be CTaiiB formed to 

"Tb« pumb^r' of mflsle pot tone Che wltcha'a of Ftrth brew le 3. 

In ^1 d d i i i [?n Co a declaration of n.iiiiib&r , a a IngLc object tran- 
aitive verb mHy be uStd In ft cmnparattve sCructurej, such as eiihlbited 
in the aentence "TDin tiSa twice as many fish ag Mary ha* gupplea.'''' 
The HETEOR rule which glvea the effective E r atia forma t Ion for this 
type of aentence structure isi 

<* <$ (^lA'EM) ^ A& MANY ? AS ? {^1/V^tt^} $> 

<THE IfUfflJER OF 6 1 2 1$ J Tl[E DUMBER OF lEJ fi 1?) *) 

For the example, the trSnafoniied SenturiC^ ts: 

"The number of fish Icon has Is t*rici the oumbar of gupples 
Hary has" 

Trana format ion of new sentence fornbats to f&rmata prevloualy 
"uiuderECood^' by the program cirn be eftally added ti> the program^ thus 
eKC ending the subset of English "underctaocl" by StUlSi'3lT- Xn the pro- 
««43iti^ that actually takea place within STUIHEHr the Intermediate 
a«titeirtcea ehovn never exiat. It wag easier t« g.a directly to the 
dhj^qI f^OdD th& format, utilialng aubroutlae^ ptevi^usly d^fin$d In 
tETnS of the- setfiantica of the model. 

The wofd "is" indicates equality only if it is not uScsd ag 
an auKitiaxy- Hie eHoaple below ahows how ■viErbal phrases containina 
"is", B-iiih a* "la BulCiplied by", and "is incTe^tfc^l by" are handled 
in the tranaforraation. 
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(TlTE PFIOEIEM TO BE SOLVED IS) 

(A HmHER IS MULrri^LtKn by 6. THIS PRODUCT IS INCREASED BY 44. 

THIS HE5ULT IS 6S . FtHD THE NIIH&KR. .) 

(THE E^JUATIOHS TO 16 SOLVED ARE) 

C EQUAL XOOOOl (WIIKBKR)) 

(ECJUAL (PLUS (TIMES tHUMBER) 6) 44) ^8> 

(ffiCE aUHBER IS ^^) 

The aent«nc« "A numbfir ie multiplied by 6" only indicates that 
two objects in the model aife rv!;tte<d miLtipLltit tvely, Btwl doea not 
indicace eaplititly any equaLtty relaEioti. The Lnterpretaeion of 

this sentence Itl the model i& the prefiK moCation product; 

(TIMES (HUMBER) 6) 

This laCCfst phrSEe ia atored tn i tsa/parsxy location for posBible 
Later reference. In this prebltm, it is if«f^r^nced in tbc nestt aen- 
tsTice. with the phrase "tnis PRODUCrV. Tit* important word In this last 
phraEe ia "THIS" — STUDENT Ignoi-us tjH other werds in a v^ri^ble' con- 
taining the key w&rd "THIS". ISi* last teBporarjly atored phrase is 
^uh^tituted fot Che phraae containing '"miS". Thus, the first thr«« 
sennit encea in the pCdblem ahovn above yield only one equation, after 
tw- Eiibatitutiene for "this" phrases. The last sentence "Find Clie 
QumbeT." is trartsfprmed as if it w^re '*HhaE %S the number Q*"., 
and yields the tittt e^q nation shewn* 

The word "thi»" Tiu^y occur in ^ concept where it is 0ot 
referring to a previotiBly atored phrase. Eelow is an ejiaiiip'le of 
?uch & context . 
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tTT[E FROBLilM TO 'Bti SflLVED- IS) 

(THE PRICE or A RAlHO IS (J9 . 70 DOLLABS . IF THIS PRICE IS 

L5 PERCERI LESS THAK THE MAKKfiJl PRICE ► FIND THE MaEEEB PRICE.) 



<THE EQITATIOHS TO BE SOLVEI> ARE) 

(ECJUAL KOOOOl (HARKED PRICE)) 

(KltTAL (PRICE OF RADIO) (TIMES *S499- (MARKEJJ PRICE))) 

(eqyAl, (PRICE OF RADIO) (TIHES &&*70 (DOLLARS))) 

(THE HAUCED PRICE IS £2 DOLUilS) 

In auch conCeKts, ttie ptir^S'^ contatnitig, 'THXS" la replaced by the Left 
half of the laeC e-quStion crefltcrfn In this eJtafnpLe, iS-tlfOENT brftalta 
tli« lasC Eecitence into two atinple senCente-S * deleting tilt "IT"- 'Iben 
th* phTflR£ "^IS PRICE" la replaced by the v*r table "PRICE OF RADIO"* 
vhlfh is the. left half of the prevlouB eqiiiitioai 

Thli problem llluatrates two other features □£ the STISDEHI pro- 
gr^artt Th^ first ia the action of ths c^Jinple^ opeT^tor "perE^ent leiH 
thsn^t It cauaea the number ImmedititeL^ preceding; it, l+e+ji 15* 
tt> be sijbti^acted Crom 100, thl& xaault divided by 100, Co Jive *&^ 
(print e4 ^a «S499 du« ta a Tounding, error in floating point couverelon) . 
Then, this ciperator becontea the InfLx operator "TIME3"<. This is in- 
dicated in the tabl« itl flaure^ . 

This pro bl gen also illaatratea how unit a anch aa "do Liars" are 
hanjdied by tilt STTrtJ^S^ program. Any vord which inined iate ly folloKa a 
gjumbei is l^bt'-lod as a ap^ciaL type of variabLe c^LLed a unit . A 
mmbET fO'llowe4 by a. unit la treated in the equation as a pTo^nct of 
the nujmbci' artd the unit, e,E.,%$.?0 DOLLARS" betuuvc^ "(TtHES 
69*70 { EWLLAKS) > " . Units ara tre^Cenl as spcciiaL i^airiabl«9 in solvins 
tht 8-et of equations H a unit Tiiay appear in the aaawer thougji other 
variables eaniioti If the value for a variabLe found by the aolver is 
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Cht ^ri^Mct at a nunber and a unit , STUD'EtilT concF^Cen^tes the iiJumt>ET 
ttid tb« unit. Fq-t eKBBpIe> the aoiutlon for "(MARKED FBICE)^* in 
Che pi-ei^l«(ii a^v^ was (TB4ES SZ (DOLLASS)) and STOIDEKT printed out: 

(IH£ MARKED PRICE IS 82 DOLLARS) 

Ilt«if« la atl ^iXceptlQn to the face that any unit may appear In 

ttiv dTi^ueif^ as llluaCrated in the problaEn btlo^. 



CTHE FmmLEH TO IE SOLTCU IS) 

CtF 1 SPAN EqUALS 9 INCHES, AHP 1 fATHCW EQUALS 6 FEET, 

HJOtf KAHY SPANS EQUALS I FATOOH Q.) 



(THE EQUATKJflS TO BE SQLVmi AHE) 

tKQUAL XOOOOL (tIKRS 1(FATH0HS))) 

UQUAL (T'EHBS 1(fAT(I0MS)) (T2iE& 6 <fE6t)>) 

(EQUAL (TIMES 1 (SPANS)) (TlHES 9 (INCHES))) 

IDE EQUATtOB^ WKHE IHSUFFICIEHT TD rtBD A SOLUnOW 



(USIJC WB TOLLOttlHC KNOWN DELATIONS HIP&> 

((SqUAL C TIMES I (YAEPS)) (TIKES 3 (FEET))} (EQUAL (TIMES 1 

(FEET)) (TlMBS IZ CINCHES)))) 



(1 FATHOM is S SPANS) 

l£ tha unit <?£ Che answer it ApecifiedK in this praiLem by the phi^agit 
"haw many ^paus " -^ then only thiC unit, in thia profcLem "spans", 
Tit#y .-i-p-pir-iT in the anf^tr^ Without this restrict Ion > STUVENT vould 
blithsly «n«wer thic i>]:Dbleiii with "( I FAIHQM LS 1 FAIMM)". 

In th« C r«ns Fi^miii t ion froin the English sCateTnent of. the f rob 1 pen 
to the cqu^ clans, "9 LHC:HES" bccams (TIMES 9 ( HECHES) >. Ucuever , 
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"L fATHCW" becamje "(TIMES L (FATHOMS))", Thfl pUrsl fotm fpr fithom 
has beea used Instead of the eingul^t iatm, STWOEI^T al^^y^ »9a tlfl« 
plvraL foum if known, to ensure that all utilte afipesr in uflLy one 
fertn^ Sirtce ^'fathom" anfl "fathoTBs" iSre Njiffctetits if both ver« waed 
ST13DEHT vould treat them as distinct, utir*lat&d urtits^ Tfse plural 
form is part of the global infomtatiort ChaC efli> b« tnade available 
te &TUJJEHTi and the pLuraL form. o£ ^ i^i4 is ai,Lbfit itnted for atty 
BirtguLar form appear in£ afCer "1'^ in «ny ^liJ^ase- Thn? inverse opera- 
tion is carried out for CortcCt f^irltlC^UC Qi the solution. 

Hot ice that the inf onng t ion given in ttie problea was itiauf fieient 
te allow ao lilt ion *f the set of squat iuos to be solved. Therefore^ 
STUlHSflr looked in its glossary f&? information concerning e-acti of the 
unit* In thiE eec of e^guatlonPt It feufld th* relationebipj "L root 
equ«l« 12 Inch e 3." aiid "1 y?ird equsl* 3 f«*t." Uiing only the first 
£*eej and the equation it implies j. STUPEHT ia Chen able to solve the 
problMi* XhUB, in certain cSSet Where a problem la not analytic , 
irt thft senee that it does not contains eKplicltly stated ^ *11 the 
infofiD^tlon needed Eor its solutions STUDENT ia able to draw on a 
body of t^Ctit picking out relevant Onft* , aftd une them to obtain a 
aolution . 

In cert Jl in probletasj tbe trEinafomiiit ion proCeJS 4vtS not yield 
a set of aolvUblE] e^naClona. However > within thlt ««C of equations 
there exiaCs ^ pair &f variablaa ^or more than on* pair) gtfth that 
the two variables aft only "slightly iif fcr-cnt", and really name the 
aame object in th* riwdel. Vhen a set o£ ^n^uat'^ons is wftsolvable^ 
S'HJDENT se^rtheg for relevant global cf^Tjatlarts^ In add it ion ^ it 
uses several heulftatlc techniques for i^&ntifyins two "slightly 
diffeircnC" wariablea in the equstione. The problem below illustrittos 
the identification of two variablea where in one variable a pron^iun 
haa been aubatituted lor a noun phrase in the other variable ■ Hila 



m 



ikJeTitlficat ton. ts made by checking all variables appearing be f o r e one 
contdinlns tbe proiuDun, and finding ane whicb 1$ identical to Chia 
pronotin phraae , wtth a BubaCltuttotl of a string of any lenfiCh for 
the pronoun. 



(THE PRJOELEM TO ftE SOLVeU IS) 

(■niK HllMfiKB Of SOLOlERS TKE RUSSIAKS HAVE I£ OME HAW OF THU 
NtMEEP OF CUJ^S THEY HAVE * THE KUMBER OF GUNS THEV HAVE T£ 
7O&0 . WHAT IS niE HatBEE, OF SOLDIEKE THEX HAVE Q.) 



(Tlffi EQUATIONS TO- BE SOLVED AKE> 

tEq4JAL KOOOOi] (HUMBER OF SOLDTEHS (THEY/^ED) <RaVE A'ERB) > > 

(equal (NUHBER Of ClHTS(THEY/FaO) (HAVE/VEEB)) 7000) 

(EQUAL (HUHBER OF SOLDIERS RUSSIANS (HAVE/VEEB)) <IIMES .5000 
(TJUMBEB of CUHS (THET/PRO) (HAVEA'EEB)))) 

THE EQUATIONS WERE IHSUFFICIENT TO FIHD A SOLUTIflll 



(AS.^UMIMC THAT) 

((tlUHftKH QfT lilOLfill^ltSf (they /fro > (HAVE /VERS)) IS EQUAl TD 

(NUMBRR OF SOL&IERS RUSSlAHS (HAVE/VEPB))) 



(THE miMEER OF SOLDIEI^ THEY HAVE LS 3500) 



If two variables taatch in thts fashion, STUDMT siBEuniea the two 
variables ^tt; frqual^ prints out « StdCesKinC of this a a s uinpt ion , as 
ahown, and adds ati equatiisn «Kf]r«it«iTig thifs equality to the set 
to be solved. T>ie Sylutipn protectura is tried again, with this 
additional e^Liation, In the ejiarnfilcj the a<J<llcional equation was 
Bufflclent t& aLlo\^ determination ef the solution* 
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Ih« «KaDi!pl£ belov ie again ? "non-afia lytic" pr^bl^si. The first 

»et of c^u-ati^ina developed by ^TUDEHT Is uii«cilv«1rl«» ^«r«Ep]f«^ 
STUDENT cries to find some ralavant eciuations in its SCOtC of glQ- 

btl information- 



(THE PROBLEM TO BK SOLVED IS) 

(THE GAS CONSUHPTIO'N OV K¥ CAE. IS 15 MILES PER GALLON r 
THE DISTANCE BETWCEK ISOSItHJ AMD- HEH YORK IS 250 MILES. 
WHAT IS Tl^E HVHJltK OF OALLOHS OF GAS USED ON A TRIP 
BETWBHS N£W TORK AJfD j&OSffiiH Q.) 



(THE EQUATIONS TO JiK S{)l,\fED ARE) 

(EQUAL KOOOOl (NtJMEEH OF tiALLOSrS OF GAS USED OS TRIP 
BETWEEN NEW YORK AKE" BOSTON)) 

(EQUAL (DISTANCE BETWEEN BOSTON AND NEH VIDRK) (TIMES 
550(MILES))) 

(EQUAL CGAS COKSLlMPn™ OF Wi []All) (qiJOTIENT (TIMES 
15(^1LE3)) (TIMES 1 CGALLONSJJ)) 



THE EQUATIONS WERE ENSUFF IC lElTT IS> FIMC A SOLUTION 



(US INC THE FOLLOVIKG KNOWN RELATIONSHIPS) 

((Bl-QUAIL. (DISTANCE) (TIMES (SPEED) (TlMfi))) CSqptL (DISTAHCE) 

(TIMES- (CAS- OOUSUMPTION) (mJHEER OF GALLONS OF GAS USED)))) 

(A!?f;UHlNC That) 

(CDI5TANt;E) IS EQUAL TO (DlETAlfCE BETWEEN BOSTON AHD H^ 

YOEIK)) 

(AS&UMINC THAT) 

(<GA& OnSSUMFtlON) IS EQUAL TO <GAS OOHSUHPTlOS OF HT CAR)) 

(ASSUMING THAT) 

((NIMBER OF GALLONS OF GAS USED) IS EQUAL TO (HUMiBPR OF 

GALLONS OF GAS USKi] ON TRIP BETWEEN HEW YORK AND EOSTOH) ) 



(TUB HUH^BE OF fiALLOMS OF GAS USED OH A TRIP fl^lWtEM 
mm VORK AMD BOSTON IS 1&.*6 GALLONS) 



It uses the first word o£ each variable string 0s * Vfly tft iCi 
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gloEBary. Th? one exception to this t\il^ it th^t Ihe words "number 

of" are ignored if they are the first two words of a variable string. 
^UQ> in this problem, STUDENT retrieved equ^tioiiB which vere stored 
under the key words diBCance, gallons , gaa . and nailea . Two facts 
about distance had beEH stored erirLier, "distancE equala speed times 
time" ^nd "distdnce equals g;is CDnsumptlon times number of ga Lions 
of gas used". Hie equaticfna IjnpLicit in th^BC Bent&Ac^a w^v^ sti^ved 
and tetrieved now ^ as poaaibly usefui for the solution of this 
problem. In fact, only Che second is relevant. 

before any attempt is made to solve this augmented s^t of 
equationB, the variables in the augmented set are mstched^ to id^nti' 
fy "altgjitly different" variables uhich refer to the g-amc ^bj^Ct in 
tbe madel. In thie atsinple "(DISTANCE)", "(GAS OOKSUHPTTON)" ind 
"CHTJIMJKH Ob' GALLOWS Of CAS USED)", are all l^^ntLfied with " similar" 
variables. I^e fplLiTwing conditions muat b& sstisflftd iitt thle type 
of identification of variables Pl and f2: 

1) PI must appear later in the prisblcru thin P2 * 

2} Fl is completely contained in F2 in th« ««nie that Fl 
is a contiguouB aubstring uithin F2. 

This identification refl^tts a eyntact^f phcnomcnrm where a 
truncated ptirase , with one or cnore mi:;?difyin^ phrases drc^ppc^, is 
often, used in plaoe of the original phrase. Far exampli^j if ttie ptir-)e« 
"the lengtb of a TecEangle"' has occurred j th« phrase "the Length" 
may be used to mean Che aatne thing, this tvpfl of idenci ficstion is 
distinct fTom that rasde using prpncun su1)Stitution< 

In th« eKan^lfr ahovet a istdtfd ^ctiemtl was used by identifying 
the variables in the echtma with th* variablas that occur in the prub- 
IttBk Ihia problem is solvable because the k-e^ phrase:! "distomce", 
V)^d3 ^onsunnpCion" ^nd "number of gallons of gas us&d" occur as 
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SubEtringE of Che v3Ti3l>leE in the prabLem. Slnc« SruX>ET]T idenEl- 
fisB each generic: key phreBe of the ech€£mj ulth r partif^il^t v^ti- 
dble of the pToblefm, any EchaniB cea be used only once In ^ prdbLgnt 
Because STUDENT handles scheme in this sd Hoc fsthiOA It cannot 
solve pifoblems in which a ir^latidfi^hip iuch at "dlitdnce equiPls 
speed times time" is needed fov tVC differttiC v»Iiiea of distance, 
speed, and time. 



E. Pnaaible Idiomatic Substitutions. 

There ;<re a acne phr^iaea vhlch h^ve a dual character^ depending 
on the concede. In Che example below ^ tbe pbrase "perimeter of a 
rectangle^' becomiea a variable with no reference to its meaning, <tT 
definition, in temiE of the length and width of the rectangle. 
This definition ia uaneeded for aoluEion. 



(THE FKDBLEM TO BE SOLVED IS) 

(THE SUM OF THE PESIMErER OF A RECTANGLE AHD DIE PESIMETiE 

OF A TRIMGLE IS 24 INCHES. IF ^E PERIHETEft OF THE SECXANGIE 

IS TWICE THE PEBIHETEa OF THE TRIANGLE, WEIAT 13 HUE 

PERIMETER OF THE TRIAKGLE Q.) 



(THE EQITATIOHS TO BE SOLVED ARE) 

(EQUAL KOOOOl (PERUfETER OF TRIANGLE) ) 

(EQUAL (PERIMETER OF RECTAHCLE) (T^ffiS 2 (PERJKETeil OF 
TRfeEfGLE))) 

(EQUAL (PLUS (PERIHEt'ER OF RECTAMGiLE) (PhKLM>;T>;K qV TElrtHCLS)) 
(TIMES 24 (INCHES))) 

(fHS FKHmii-EiiR Q1-' miK I%Ul!TGLE IS 3 UCBES) 



llcit'«v«ir» th« £ollOftlTl^ ^1^c>bl«m £,3 4tde«>J in tstm^ of the perl- 
ra^t^f p I^Dfitb aod widtti of tb$ rectangle, ri^anj^fciriaitig the ^la^llgh ifltO 
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(THE FflCBLLH TD SE S-DLVCQ I S> 

(THE LEPiGTH DF k RECTABtGLE IS. B tNCMtj fX>R% THAN THE 'iMIDTht 

OF THg HECTAN6LE . ONE H*LF QF THE P-emHeT^Fl Of THE flECTANGL-E 

1$ II mCHts ^ fma the length and tke. hipth «f the hectangle 
■ ) 

{THE f^U^Tl^NS V3 il BDLVED ARE) 

(EQMAI. GfliSl* (Hft&rK «f RECTAMGLEJJ 

{EQUAL GCZS15 C LENGTHS 

fldVAL {TlJ^E^ .MiyQ {PERIMETER QF RCCTANGl^)) (TJMtS It <]NCHE$|}} 

(^^VAL {LENGTH OF AECTANCLE) (PLU4 (TIMES < (IHCH-ES)) EMt&TW 
OF RECTANGLE})) 

HHE E{3UATIiiNS biERE INSUFflCIIHT TO FFHO A SOLUTiOH 

(USIIHS THE lrOLL0iNINi; KNOflN AEIAT I DNSHI P^) 
(tiduAL (TIHEJ 1 (rgETJJ £TTHE5 1! (INCHESJ)}! 

(ASSUJUINS THAT) 

{(iEN<iTH> IS i^UAL TO CLEHGTH of nECTAN^lC)} 

THE EdUATIOlHS IJEIIE I hISOFF | C | ENT TO FJND A SOLUTION 

Tfi*lNEl POSSIBLE P PI 0*15 

(TME ?JH)BLEM NITH AN ttHflHAIIC SUBSTUTIOV IS? 
(THE LENGTH OF A RECTANftLf i$ $ INCHES. MORE THAN THE Wl QTH 
OF THE RECTANGLE . ONE HALF QF IWltt THE SUM OF T14E LCNliTH 
AnO WlOTH Of THE HECTANCLE IS It INtHC& . f J ND THE LENGTH ANQ 
THE WIDTH OF IhE aECTAN<»L£ .} 



ITMC EaiMTIONS Tfl *E SOLVE& AAE) 

{EqUAL Q0151» {HIDTH OF ftECT^hN^LElJ 

{EClU^L CaZ517 {LENCTHn 

ttOUAL [TIWES (TIMES »SaJ)(|i S5 (PLUS {LENGTH) {btlDTH OF RECTANGLCllJ 
{TIMES IS ( INCHES})) 

{EaVAL (LlNi^TH ttf HECTANGLEj (FUUS {TIMC& 1 {|HCHESl)< (HIOTK 

OF flECTANfilE})) 

THE EHb^AT^ONS NEfil LNSUFF ICIEHT TO F I NO A S0L4JTJDH 

[OSlNXi T1HE FQUQWINC KhQWH AELATIONSH IPS ? 
{(EQUAL (TIMES 1 {FEET J) {TIME? IZ (1NCHE&}))) 

(AS&UHING Tl^AT) 

(CiENGtHJ E£ EQUAL W ( LENOTH QF ll£cTANdL£]} 

(THE LEHdTH IS li FNCHES) 

CTME -rilOTH «F THE RECTANGLE IS S INCHES] 
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flquatione Is iwjt cufflclent iov solM-tlon. Ifelttier retrieving and ua- 
iiig an equStion abt^tit "lrtch«Jj'\ Ch* tifilt in the problem, aor identi- 
fying "'lETietln." with # It^rtS'fit ^ht^^f a*CVt to mak* the problem aoL- 
vable+ Thefef&rc, STUDENT ISdJiJ in Its dictionary of possible idioms, 
and finde one which it C*n tty ill th* p.r'Pblrni+ STUUEHT actually 
had two passible idiomatic fuhetituCioTiiS Vti Ictt tt Cdul4 h^v6 tiAdc 
for "perijneter ot i rectangle ": otte wne in t*nn« of ttl6 length and 
width of the rectimgle Diid th-a other W9I in tenas of th& ehorteait atud 
longeat aidea of the rect^mgls. Whsn thete dt€ two p<ftti-b\i: SubstLEu- 
tlona for a gi^en phrase, one ie trie^ f irftt » n4[w?Ly tht c^nc STUDENT 
has been told about ntost re^ently^ In this ^FOblCMi the &orr«Ct &nt 
was fortunately first. If the other h#d been firsts the teviiw^ 
prc^bl^m uould not have been any inore solvable ch^n the original, 
and eventuality the ae^^nd (carrect) substitution wciuld hove 
been Tna<J« f Ofily Ofie tton-nandatary Idloousatic substitution If ever 
nti^de at 0^4 tiiU^! although the substitution ia made for dll occur- 
renices c£ th« phras* dlOStn* 

In thi« problHn^ tht Idiomatic substitution made allows the 
problem to be solv^^ sftcr Identification of the variables "length" 
atld "length of reot;in;le"- Ihc tctJflevftd equation about inches vs9 
not needed^ However > Its preaenct in the set of equjltions to be 
eolved did not «i.d«tirack the solver in any way. 

^is use of poffiblE^, tu-t n^^i^-tfiandatory idiomatic suhEti tut ions 
can also be used to give; STIFPF3TT a uay to solve problems in which two 
phrases denoting otifr particular variable are quite different. For 
mample, the phraEe, "students whci paa«&d the admisstonE test" and 
"*u.eeeesful candidates" ml$ht be deacrlbing the aarae set of people- 
However, Bint* STUDENT kflows nothing of the "real world '^' and its 
value AyttiM f^r 3ij.c{:esB, it trould never Identify thefee two phr^^«$ ^ 
However^ if told thjt "BuocessfuL t*ndidate*" 4o«*etiiiie atana "atcudefita 
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vho ^fiffe4 the AdmissiDnie test", It vauld be able td Solve t) probLem 
uSlTtg thcHit two phrases to identify the asme vartdble. Hiub , pas^ 
alltlet Idlcnndtlc: Suli^t Itutions serve the duel ptirp$s« of pTovid'ing ten- 
C^Clv? subatltLLtlons of def initio cib , end IdtnC Lf Lcdtion of synQLio-r 
1IIDU5 phrases. 



F* Special HeuriBticB. 

nie methads thus far dletussiGd h^rve heen jp-p-lleable to the 
entire range of algebra problcTni* However, far apetial t Liases of 
problem^, addlcien^l heutistics mil y be used vfiin^lh sr? needed for 
aeiitbtfvf of the cl«s«, but net applicable te Othvf probleniB. An 
e5tample is the cims of ase problsuBj a a typified hy the probletn 
belou ,. 



(THE PROBLEM TO EE SOLVED IS> 

(BILL S FATHER S UNCLE IS TWICE AS OLB AS BIU, $ FATHER. 2 
YEARS FROM IfCV BILL S FAtHEa WILL IE i TIMES A3 OLD AS BILL* 
THE SUH OF mEIR AGES IS ^2 . FIND BILL S A^JE .> 

(THE EQUATIQSJ3 TO EE SOLVEC ARE) 

(EQUAL XOOOOl ((Mhh / PERSON) S AGE J) 

(EQUAL (PLU^ ((BILL / PERSON > S (FATHER / PERSOITi S (UMCLE 
/ PERSON) £ AOE) (ITUS {(BILL / PEESOS) $ (FATHER / PERSON) 
S AGE) ((BILL / PER.SQK) S AGE)» n) 

(EQUAL (PUIS ((BILL / PEHSOS) S (FAlT-LEM / PERSON) S AflE> 2> 
(TIMES J (PLUS ((BILL / PER50H) 3 AGE>)> 



(BILL 3 AGE IS 3) 

HefotE the age prablgiffi hcurlstic-c are uaed^ a prt>bLeni imust b^ 
Identified as belonfiinE to- that cLaas of pif obi ecus, STETDEST tdenttfia* 
age problemfl by any oc c nsf t trt cb of one of the £e I loving phraaeaj, "a* old 
as", "ye.-JTs oLd" *nd "jjge^^ Thl* identification is ciadf inmed lately 
afcer all uorda are lo^k^d: up in the dlttipnary and Caeged hy function. 
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AftQf th* ipcci^l h^uriBtlcB aire UBsd the modified problem ia trans- 
fonned Co BijuStiotii ai iJeecrllieid previoLtsLy. 

The need for epeetfll in*th-odfl for age prolileiaa arlBSE because 
of the conventions ubc4 (<ft itsttirtin^ ttie vaif tables, all of which are 
ages. The word sige Is usually nUC wfted explicitly, but la Implicit 
In such phrdses iis "^^ k?1^ a$". people's nanfcea ere uaed where their 
ages are really the implltU vartablea. In. the example ,. for Instance, 
the phrase "Bill's ftfthctU uficle" Is used InaEead of the phr»se 
"Etll^s father^ s uncle ^s *ia"* 

STUDEEJT uses a special h«ufi4tlc Co wsk^ all these ages c>[' 
f)l£«£tv It? do this, it muBt 1(iiow vtilch W&rds are "person words" and 
thetiE!£ore» aay be asaociated wicJi an age* Ft>r tl'*ls problem STUOEMT 
h«« lM«n teld that Bill , f£ChEr, utwJ itn&lfe are person words. They 
can be Sv^ti t^^^^d a@ such in the equiitlErnfi« The '"' " f^l lowing a 
word 1e the STVD^t rep resent a tlDit for pOtSCSSivCf used instead of 
"apostrophe - s" foi? prograeaning convenience » STnUENT inaerta s 
"B. AGE" after every person word not foLliswed: by e "S'^ (he c suae this 
"S" indictees that tbe person-word, is beio^ u£«4 itl s p^sae^selve 
sense, noC at an Independent age v«rl{ible) b Tfiutj se lndlc:sted^ 
the pbiras« "SItL S FATHER S UNCLE" b«C«[i«es "BILL S FAIHER B UHCLE S 
AGE". 

In aLtditi^rt to changing. phraseE nSmifig people to ones naming 
agea, STDEEHT makes certain Bpeci.-^l iiJiwtiaCl& aitbst Ittitions . For 
the pl^i^^^^ '^tJielif ^E'Sb", STUflEHT substlttitefl a conjunction of all 
the age vari-fiblts ettc<Hi-nteEed in the probleTii+ Irt th* enaaiple, for 
'■THEIR AGES" 5TUBEHT aubatltutes "BILL S FATHER S UMCLE S AOE AHD 
BILL S rAlTIEft S AGE AlfD BILL & AQt" . The phrasea "as old ^s" and 
■'years old" ore then deleted as duTraiiy phrases not having atiy neatiing, 
and "will be" inud 'Va*" are changed to "ls"r There is tio need to 
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piC444tvc Che tccisc of the coptiLSf sine* the Scr e of the EtttLire or 
paaE tttifle Is preserved in auch pre flu phrage-s as "2 yeara from now'', 
or "3 y*flf8 ago", 

Ttit refnaining HpeciaL age problem h«Tj.rlstltB Sre used to process 
the phcas&s "trt 2 yea re", "5 years ago'' and "riov"^ The phraae "2 
years front tww" j.« trana formed to "In 2 years" before proeesslng. 
These three Etaifr phrases may occur tTnried lately after the word "age", 
(e.g., "BllTs age 3 years ?go"> or at the b^^iflfilng of the aectence . 
If a tirtte phtasc occurs 4Jt the beginning of the ietiteTice, it iitip Lic- 
it Ly [B£idifi4$ all fl^pi mentioned in the senteace, eK C &pt tbose 
fallowed by th^ir own time phraser For eKaaple^ *'tn £ yeSrs Bill's 
father 'j a^fl will be 3 timea BiLL^s age^' ia «q|uivalenC to "bill's 
father ^« 4^-e In 2 yeara Hill be 3 times Kill'* age in 2 yeara'^ How- 
evelft "J yesrs ago Mary' a age uaa 2 time* flml^* age now" la equivalent 
to *'Hapy*s a^e 5 years ago waa 2 times jUnn.'' a a^e new", Thiia prefi^c 
ILmti phraje« 4 re handled by diatrib^tin^ th^rm cjv^c <tll sgea not 
TTTodiflcii by Another time phraae. 

After these prefix phragte hava been diitributed , each time 
phrase is tranalated appropr lately* The phrase '^in 5 yeara^'" eaijscS 
5 to be added to the age it fftll&ws, *nd "? years ago" causes 7 
to be subtracted froai the a^t preceding this phrase, tlie v^X^ "now" 
is deleted. 

Only the special heuxistits described thus far were tLeeessary to 
solve the first age probleHi* The second age problem, glv^sti below, 
requires one additional heuristic not previoualy mentioned. This 
1» a substitution for the phrase; "was when" which effectively die- 
couples the two facta couibliie<3 In the firat aentence. Fisr "w*s 
when*V» STUDEtaT substitutes "was K ycaj-j agp * K years ago" where 
K is a new vflrtablc created for this purpijaif. 
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(THE PftOSLEM TO BE SOLVED IS> 

(mKY l^ IVICE AS OLD AS ANN VAS ^Eli' HARY WAS AS OLD AS ANN 

IS KJW . IF MAKY IS Z4 ¥EARS 01J>, HOW OLH IS AHN Q.) 



tTHE EQUATIOaiS TO BE SOLVED AHE> 

tEqUAL XOfHHJS ((AHH / PERSON) S ACE)> 

C EQUAL (( MAfiY / PERSON) S AGE) 24) 

(EQML CPLU^ ((MARY / PERSON) S AGE> (MtHUS (XDDOOJ))) ((ASH 
/ PERSOU) S AGE» 

(EQUAL ((HAHS / PERSOH) S AGE) (TIMES 2 (PLUS ((ANN / PERSON) 
S A(3E) (MIKUS (K[}CH>07>))>) 

(A-m S AGE IS Ifi) 

In thit e^afflpLe^ thie first sentence bec-oinei the tVO aenteficesr 
'"Hary Is tvi-ee as oLd as Ann X[>[>t)[>7 years ago* 3D00O07 yea re ago 
filary was as old sa Ann ts now." These two otturt^ertcea of time 
phr^ie^S are handL^ as discussed pTeviouely^ SimiLarLy thi {ihrese 
"will b* vlietl" weuld he trans fornied to "in K veSrs ► In K y^ata". 

These decoupling heuristics are UBeful not aaly iat th« STUDENT 
prD'gr^in but for pbC»pli& trying to solve age prDbl?nis+ The classic age 
probleBi about Msry and Ann> given aboTfe, took an HIT graiJuato Student 
over 3 minutes to fOlvC: because he did not know thlfi hcuTtStiCr Wltli 
the heuristic he wSs able t* set np the appropriate equattons twuch 
HK>r& rapidly. Aa d trudc mcasuire of STUDEMT'a relative ipsed, note 
that STcrDEflT took Lesa than One mlfiMte to solve this problem i 



tflien All Elae Falls* 



For all the prpbl<nns discussed thtia far, STUDEtlT was abl« to 
find a solution eventually. In some clses, howevur> necessary glo- 
bal information is missing fttJin lt9 store of information., or vati- 
ables which n*mc the sam* object cannot be identified by the heuria- 
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Cics of the prosram. WheneT^er STUlJEMT taniwt find a solution for any 
reason, it turns to the quesEiatier for help- As In t tie probLcm 
below, it printi out "(DO TtfdU VHW ANY HORE RELATIOMSHIFS BETHEEM 
THESE VARIABLES)'' followed by 3 list ot Chu variables in the problem. 
The questioiieif can answ«t "ycS" lyt "nt>"* If he aays "yes" » 
STWDBNT says ''TELL ME", atw! chfl ijuesC loner can append another seft- 
tence to the statement of the problaui. 



(THE FEJOBLEM TO BE SOLVED IS) 

(THE GRDSS WEIGHT OF A SHIP IS. 20flOO TOlK$ . IF ITS MET 
WEIGHT IS 15000 TOtfS , tfMAT IS THE UEIC^HT OF IHE $HlPS 
CARGO Q.) 

THE EQUATIOHS HERE I»S(/PPICTEHT TO FINO A SOLUTION 



TEYIHG POSSIBLE ICIflMa 

(DO YOU KSIDW AHY MORE EELATIOMSHTP$ AHDHG THESE. VARIABLES) 
(GROSS HEIGHT OF SHIP) 

(Toas) 

(ITS SET HTIGHT) 
(HEIGHT OF SHIPS CAftOO) 



yea 
TELL ME 

(the weight of a shtpa carga Is the difference between 
the gross weight and the n^t ufrlght) 

THE EQVATIONS MZS& IKSUTFICIENT TJO FISD A SOLUTION 

(ASSUMING THAT) 

((HET WEIGHT) IS EqlJAL TO {1T& HET WEIGHT)) 

(ASJ^llMtHC TtSAT) 

((ySOSS uelGHT) IS EQUAL TO (GROSS WEIGHT OF SHIP)) 

(THE WEXOST OF THE SHIPS CARiSO TS 5000 TONS) 
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Jn this pTdbl«M> the additioTUl inforaation typed tn (In lower 
C3E^ lettETB) w^S avfflcient to boIvb Che probLeuir If tt 'Wfl* Bat, 
the fluestion would be repeated until tb& queat loner said "rtCi^\ QT 
ps?<yuideB aufficienC icifarmation for bpIuCIuo of the problem. 

In the i>riL>blem belaw> the solution to the See of tqua- 
tlorts Involves solvl^ig a quadratlt equatianj which is beyond tt>e 
i^athematlcsl iibtUti' i3f the present SIUDEHT aye C em. Note that In 
this case STUDEHT repnirta that the equJItLotiS u&re unaolviible, not 
S limply tnsufficl'ent Eoif solution. STUDEHt at 111 requests additional 
Infoifmatton from the ^ufistioner . In the example^ the questioner saya 
"no"> afld STUDENT sCStuS that ^^I CANT SOLVE THUi: NiDELEM" «n4 t e rml tia t ea . 

(THE FBJOELE^ TO ^E SOLVED IS) 

(TRE SqUARE or THE DIFFEEEHCi BETWEEN THE IRlHBeR OF 
APPLES ANP TllE HIIMEER OF OEAHCES (JN THE TABLE IS EqUAL 
TO 9 . IF THE KUMBtR OF APPLE5 IS 7 , FUfD THE NUHEUK 
OF OKAMGES OH TllJi tABLE ,) 

(THE EqUATTOWS TT) BE SOLVED A^E) 

(EQUAL G025i3 (HimBGR OF OHAHGES OK TAKLE)) 

(EQUAL (ISUMBEl OF APPLES J T) 

(EQUAL (EJLPT (^US (NUMEER OF APPLES) (MtNVS (NIMBER 
OF OEAHGES Oa TAllJii))) 2 J ^) 

UHABLE TO SOLVE THIS SET OF EQUATIONS 
I^TfOfG POSSIBLE IDIOMS 

(DO YOU KtlW AWi HORE EELATHJHSHIPS AMONG IHESE 
VARIABLES} 

(MiMEER OF APPLES) 

(HIMEEE. OF OEANGBS DM TABLE) 

t CAWf SOLVE THIS PROBLEM 
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H. g^^jwntary of Che STtf&jEJyE Subset of SnRUjhi 

The subaet of ErtgllBti underatandabLs by STUT>Binf li built 
atpvrid d core of aeciC&nec and phrase formats, which can fc* Cransfonned 
ifito CKpresslona in thfl STUDENT deduttLve model. On this bsEic 
tore ia built a larger set of formats. Each of the«6 Hre first crsnS- 
fonned into a »trtng built on formate in thia baalc i«t *nd; then Eh is 
ttring la t ra na f [jiriiicd Into an eKprc*Sion in the d&dutt Ive model. Fcr 
BitaiHple, Eh& format <$ IS EOUAL ID ^) is changed to the baale f&r- 
m^t (5 lis S), 3-ni the phraae "IS OOWSECUTIVE tO" Is thinnged tts 
"IS 1 PUJS". The ConsCmctiona discussed ear Her Involving a ingle 
object transitive verbs could have been handled this way, Chotjgh 
for programrnlng Convenience CTiey 1*5X6 not, 

l^e coRtplt-te Hat of the basic foxmatB aeee-pted by the pp^isnt 
STUDENT sjsce* can be determined by examining ClCi the prograffi lut- 
ing In the Appendix) the rules ftom the one lab-eled OPFORH tu the one 
labeled qSETn, The METEaR rules of the STOTEMT program precls&ljr 
a pec If y the accept able foriBaEs, aud their trane lotions to th« modal, 
hut I ahsll try to suranari^e the basic and exte^nded format* here. 
Iwiplieitly asaiuned in the SyntaK Is that any operator appears only 
within tine of the contK<tS Specified In the t^ble given in Chapter II » 
and only the operators given in the table appear. The lifting of 
STUtJEUr Starting at the tule labeled I&lfflS gives trartsUtione of 
sdditonal operators to those in the table. 

The basic linguiatic form %jbtth is trana formed into an 
equation Is one containing "ie*" as ^ tdpula . Ift* phrases "Is equal 
to" and "equals" aye both chac^^e^ to the copula "is^'. The 
auxiliary verbal Constructions "la mjultiplied hy" , "ia divided by" 
and ''ia Increased by" are a lac accept able aa principal verbs in a 
Sentence r As dlEcusaeJ in detail earlltr* a sentence Kith no 
occurrence of "ie" can have as a main vtrb a tranaltly* verb lamed i^ 
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flCely fQlliwed by a nianber. Thti number niust b* -flti element of th* 
phrase which is tht direct abject of tti* verb, se iu "Hary has 
thrcft giippiea". This type of trsntttiv* verb can «lso bav* a conipara- 
tive stfyctwre as d irctt object, e.ft./^Kary haa twice m many 
gijppi«S aa. Tom h«S Eistl". 

Ttsi* completes the repertoire of dcclfliiffltive sententc f^ymata. 
Any nura'b'E't' of ^eclarativB sentence a may be cot^ J ■o i tied , with ">a(l(i'" 
between ■aach peir, to fo-frtl e new (compLeK) declarative sentence- 
A declaraGivo sentence (even a cempleK declatat iv^) can be rn^de 
B preenppositlon for ^ quest ion by preceding it with ^*IF^* and fol- 
lowing it with a coeiFta and the queatian. 

{JuestienSN thfl:t is, requ«3t^ fcr inrorm^tioft fro« STIIOEMT* will 
be understood if they omtch ^ny cf the patteniaj 

CWMT AKE S AHD 5i (WHAT li $) 

(FIHD 5 AWD ?) (FIHU $) 

<HOW miTf $ DO ^ HAVE) (HOW MANY $ mfS t HAVE) 

<EMJH MAtT $1 IS $) 

Hits coDxpleCes Che Bumn^fy (ff the set o£ input fonviata present- 
ly under SCtHQ<3 by STUD^::^f'f- This fet can be enlarged tn two distinct 
waya, Onty ia to enlarge the set of bajli formats ^ uitng standatd 
eubroutints to aid in defining, for each new baaic format: > its Inter- 
pretation irt the deductive laodel. The other method of -cKtKrtding Che 
range of STUORpT input l9 to define tr^ms-forrflflttonE from nenf input 
farmat* to previotiEly underatood basic or eKtension f4>rfliata. In the 
next chapter we diacuaa how tbia latter type of eRtenaion can be 
performed *t run time, using the STUDENT gUbal information stora$« 
facility- A combination *f Efifiliah ^a^ HETEpR elementary (latteifn 
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elsmenta can t* used to define Che input for mat flrtd t rant foma C ion , 

Even if s atc^f-y ptoblflm is aCatsd with in the Sub-6'Et of Englieli 
Bccs^tahl^ to STUUEMT, this is not a fiuaranttt that; thts problem ceti 
he Bclvad by STO&EITT (fl-SSumin^ it to lue aolvflblft)* TWo plirases des- 
crtting the object itiuet be at worst only "Bll&htly different'^ by 
the CT iter is prtacrifeed eat Her, Appropriate global information 
must be available to STUDEHT, and the algebra involved mtist not ««- 
ceed the ablliti^j o£ the aoLver, Moueveif j thougih HioBt algebra stt^ty 
problems- foufld ifi tha Standard text a canncst b« iolired by STUQ^^ exactly 
aa vritten^ the ^ukhor has uaually been abl« to find acHne paraphrase 
*f almost all such problemB, which is golvibLe by STUDEIrT, Appendix D 
contains !» fair sample of the rafig,e of probLems that can bt handled 
by Ch« STUDEKT ayatem. 



I. Llmltettans of the STUDEin: fiubiet of EnElish. 

Tlri^ techfiiques prctented in this chapter are general and can 
be used to «nabl« ^ ConipTJCer prc^gram to accept and vfidersti^nd a 
fairly eKtcTi$iv« subset of English for a fixed semantic l>^«e . Hov- 
ever> the Currttit SlUDEHT system is estperimertal apd has 4 number of 
limitations r 

STUDEHT* S tntetptetdtlon of the input is baaed cm form«t 
laatchinft. If each form-it is used to eKpress the ineantng understood 
by STUDENT, no mis Interpretation will cccur. However, theae formata 
CCCur in English discourse even In algebra st^jfy problems » In sefrtanClc 
CCiilt4xt« not consistent with STUOBNt^ * tntfltprecation of these for- 
tltat*! For example, 6 sentence match tflj& the foTroac "<$ j AMD $>" 
£« «lvitys interpreted by SlUliEHT a» the conjunction of two dttlarative 
etateiTiefit.*. Tlicr'efore, the sentence "Tot^ ha* 2 applea, 3 bsnanas^ and 
4 pears." would be tti Correctly divided Into the fiio "SenCcncea" 
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"To" has 2 applets 3- Tsanaiias." and "4 psarB." 

EiCli o£ the opC^at'Qi^ words eh awn in Figure 4 muet he ujed ^t 
BTi opera tor in Ch^ COttCeKt ^s eh own oz a mlalnterpret^tion vill 
reaiilt. Tor cxarapltj tlie phraae *''the number of timais I went to 
the movies" whltVi should be Interpreted es 4 vdti^bl'B atrin^ vlll b& 
interpreted intorretitly as the product of the two variables ^'ninii^6r ct" 
aikd "I went to ttic mCJviefl"* bee a use "times^^ is always considere^3 to 
be en operetor* StmlLariy^ Irt the cuvrent iTBplamantntion of 5nJDE3!fI, 
"of" Is considered to b*t an operator If it is preceded by ^ny number* 
HoH-ever, the phrit&e "£ of th« boja who pa seed*' will be misinterpreted 
as the product of '"2" dDid "the boys irfio paeatd". 

Thes-e eKaniplaa obv£eu4l^ da nat cotiatitiite a complete Hist of 
alatftCerpretaCionE and BTTOfS STODENT will make^ but it should jfiT?e 
the reader an idea of 1 imitations Ort the SXUDEIif subset of Eit^lieh. 
Jtt principle, all of these restrittiocis toyl-d be removed. However, 
r«inOVtng eoae of them would require only mj,n&r ctiaf)g.«9 tQ the progicaap 
while other t IfOuId c^uire techniques not used in the curr4tt;t 
ays tern* 

For e3ci)mpLc> to orreet the error in inEGrprntlng "2 &f tilt 
boys ^o P'issed"* 0H^ e^s.fs simply check to see if the nunib^r' bffC^I^e tbe 
"of" is lest than L, ami if so, only then interpret "of" SS *(! 
operator '^times"- Howt^vcr^ a much amre eophisticEited gr:amni^r and 
parsiciE profiram would be necessary to disCinguish different occur- 
rence a of the format "(^, AHU $}*" , and correctly extritct Stmpler sen- 
tences from ^lompleK <a«rdLnate and subordinate s«nt«i1C«3f 

BtM^attflt of limit at ions of the i**t di^seribed above, and the 
tact that the STUDESH eyvttM c^iFxently occupies almost 4tll o£ th* 
computer menuory^ SIUDEHT iei^vea principally #s 9 dsaauttt^tio^ of 
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the pwftr of the techn.i<|T;i&£ uEilis&d U Itft Gonatructtdn. However, 
I bfelLft^e that OB a largBr computer &nc ftoul-d uae ttie^o techniq-ues 
t* construct a system o£ pracClcal value which would temnunicate 
well Ifich people in English CTuer Ch* linBited ratiee of material 
UtidefSfcood by the progriLm. 
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CHAPTER \f; STORAGE OF GLOBAL IMFQFlWflTION 



Tfit* alg.«bra prob Lem.' SO Iv tna ByBtem ccmtatna two pTogTAiBS 
which pf««4lt9 Bnfiltfih input. One is Cha probl^n tbus far discu^s^dp 

STUDEHT, **fitth accepts the itat&taetit of an »Lg«bra Story problecn and 
attempts Co find the BoluCion C* the: partieuUr problem. STIiDENT dots 
not Btore any infoifiBation, no-r "rerti?mber" anything itotn p rob Lara to 
problem. The infoi-crtatloti obtBined hy &T!JDENT is the l-Otal tontext 
of th& q Lie B t i [>n IK 

The (Jther pfc^graiTi la called: EEHEPIBER and it process as and sitores 
facta not spei^ifi:^ to any one problem. TheBe fBots ■make up STUDENT' e 
atore o£ "gLobal information" «l« opposed to "locaL infomtation^^ 
sped fie to the problem. Tl^is Inf * ftiia t Ion ia accepted tn a aubaet of 
Enslish which yvei^lapa but ia diffefCtkC frooi the subset ot EfifeliEh 
accepted by STUDEHT. KliHEHBER accepts statements tn cettAin fixed 
fermatsjand for eiich itjria^t the informattun. Is stored in 9 wsy that 
TTiak^a it convenient tCK retrieval ^nd uS* with in the STUDEHT prtigram. 
Semte information is 4t«r«d by actually fldLllrtS METEOR rule* Co the 
STUXH?HT program, ^n4 ochtir Information is StOifed on property llata 
o£ individual wrdB, which are unique *tamS tfl the 1,1£P systcin* 

Itie fQllo-winE sre- the format a currently Mtidereto-od by ROfEHBERt 
Eind the proceeaing nod inforfflation storage tachnlqLifcs used for 
each ont: 

1. Format: Fl EC^ALS Pi 

EKffmplc: DlSTAlSCE EQUALS SFEEB TIMES IIHE 

Process iflg J "nie s-entcnce is trane formed Irtto an equation tti 
th* *afiie way it is done in $TUBEHT. This eqi^atlon is atot*d an the 
propflirty lists of the atetas which ^re ch* firat words in each 
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variable. In th« B}t«lpl4» Cl;i6 e^quatif^n 

"(EQUAL (DISTAKGE) (TXMES (SPEED) (TlUR)))" 

ta a tor Ed on ttic property Ust* of "UISlWdCE" , "SPEED" and "TIME", 
If any one of ttietfe voTtin dppc^r^ ss ths^ InlCial uord of a variable 
iti a probLem, and global flijuaeiotia art ne&d&d to solve tbla problem, 
thia .equation 'will be T«ttiQve4^ 

1. favtaat: PI IS AH OPERATOR OF TLEVEL K 

ElcsDiple; TIMES IS AS OPEEIATOR OF LEVEL 1 

praceeeltig: A dictionary entry for Fl iJ created, with eub- 
iCtflptft cf 0? and li. For TIMES ^ thfl cfi^ct Senary etitry (tmSS / OP 1) 
it created » The dictionary entry for -any vpord is plated &n th^ 
property list of that uiord (atooi), and is cetrieved ^nd uS^d lA 
pls^B of any ^ccyifrence of that word in a prc>bieni+ 

3* FomiJitt Fl. IS AH OttRATOft 



EKiimplor OF IE AM OFtK/Lmw 

FTdCfl^sing: A dictionary entry la created for Pi with the mh- 
SCtipt OP, l^ft entry for QF la tOF/CF) ■ 

4* Format; PI 1$ Ji. P2 

EKamfkU; BILL IS A PEESOH 

Process Ingt A dice tana ry entry ia created for PL with autK 
BCtipt F2. The cntr^ fcr BILL ia (fiILL/PESSO»), 

5. FotrstatS PI 15 Iti E PLURAL QF P2 

Example: FEET 1& THE PLUhAL CF RJOX 

Protessing; Pi la atored on the piropeTCy list of PI, a £ter 
tbc flag Slp:{^; the word PL is stored on the property list of P2 
after the flag FLI^IKAL. Thus FEET is stared lifter PLURAL on th« 
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property list of the atom POOTi, 

6.. Format J PI SOMETIMES MEANS P2 

EKampLej TWO HLKEEftS SOMETIMES: MEAMS ONE MUMHER ,\70 THE 
OTHER NLMBER. 

Proceaaing: ^e STITDEHT pragram la modified ho Ch^C an Idiornatlc 

subst ItLition of P2 for PI vlll be made In a prQblea if It is otYii^t- 

vise unao-lvable. All auch ''"pc^aalble Idiomatic Etibstlt^iClocte" st& 

Cried vhen necesBary, vich the last one enteirad baing the fir at one 

tried* The STUDENT program la mtidifled by the addicioti of foo.t fttw 

METEOR rules. Since PI and P2 are inserted as left and right halves 

o£ ^ KETEOR Tule, they need not contain only words, but ooa use the 

METEOR eleirtentFiry paCterne to apecify a format change Ifi^te^d «f 

just ? phrase change. For the exacnple ahown, the ralea odd«d to thu 

STUDEHT program, aa liated in Appendix h^ are the rule lab<l4d 

C025lt), the rule folkmins. that one, the rule labeled G025H and the 

rule follouing it. 

7u Format: PI AlWAY:^ tMims f2 

EKample; ONE IWiU ALWAYS MKANS O.Ii 

Processlns: The progrflm SIunKHT i* n*odifi&d so that If Fl 
Dccmrs ^ a mandatory a^ibst ItLitlan of F2 for Fl will b^ ma^e in :any prob- 
lem. The J^tat aeptiinct- In thts format processed by REMEMBER will 
be the f trSi: mdnd^tory Subst ttut lor mitde. Thus "erne ylwaya meana 1" 
f p 1 1 owed by "pne h^ilf rilwiiya me^ina C .5" will cause the desired aub- 
at it ut ions to be m^ide; if theae aentences vere reveraed no occurrence 
of "one half" would ever be found since it would have been changed 
to "1 half", by mandatory substitution of 1 for one. 

TtyT fraeh 8eriee-nit& ip. this fni^at pftm^BS-id hy EEMEMEER, a 
tieu KETElJOK rule la addgd to the STUfiEN? |tirograrri ,, inovied lately fel- 
loving the rule nanied IBIOMS. ^e format of the METEDR rule added 
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•is C* <P1) (P2} IDIOHS) wh'Cr* Pi 5nd P2 3 re the atringa tfi th* apfi^ 
tecice proctaaeii. Thu* by u*tTis =■ combination o£ Engllah arttt METEOR 
elementary patterns and referflnce nunberB in PI and P2 j one tan Itdd 
a new format of aeotenct to the STUDEHT repertoire. Fen? ej^am^lci, thfl 
following atatcment uaa fvoci-sstsi by EEMQIBER to alloH $"njtlEHT C* 
"underatand" {properly tranafefiiO s sentence in which th« laaifi vittr 

(5 E3tCE6fi& $ BY ^ ALWAYS MEAHS 1 IS 5 MOftt 'JllftH 3) 

Hi is peroiarteftCly ex tended the STTTDEHT input aiibaet of English, 
iihlle avoiding th^ n^c^^Sity o£ JkctUi^lly editing and changing thfl 

The global infonnatloii stored for STU'DiBW't ranged from equa- 
tions to itjtm^t changes to pluriil f ornts . Ag.ain, Che cion|»atlbLe use 
of the METEOR prototype notation and the use of the generS L Liat 
processing operationa in TISF facilitated pra^raimiing of proceaaing, 
Storage 4nd retrieval of thia vide range of. ifLf<ik^aCloni In Appea- 
dlK C I5 a listing of the globs 1 information cmrrently ento^ie^ in 
tb* SIUDEHT SyitBH, 
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tm&PJSSi fl: SQtTJTIOH OF SlHlfliTAHECHJS EQUAnPMS 



Thlft chapter eontalna a rfeaciftption of thft Ll&P program 
ii««d: by STUDEHT tu *&Lve sets of stuiultatieoiia aquOttonB* The de- 
ftalttons of th* thr*e top Level furic Clone SOLVE,. SOLVER and SOLVE 1 
are shown in the £t$wif* at the end of th±Q chapter* Thts iJeacrip- 
tlon of theEB fuTitttoaa is e-BBentially independent of ^ d«t-ailed 
HnowledgE of LiSPi alChouEJi occasional parentheticltl Cfln^etits will 
be diTEGE&d to tlit naore fcnowle'dgeable.. 

The top l&vtl funetlQTi, SOLVE j Is e function o£ three flrgli- 
3n«nta. One, UbtUd BqT tn the definition of SOLVE, is the set of 
equations to be ioL-ved. The ar^untent leveled HAHTED tfl the defini- 
tion is a lt*t of vaif tables who«e valtiea are wanted i Th« thlifd 
ar^i^unenc, Ijibalod TEHHS , ie another list Cff variatle-e lAith Is die- 
jolttt froQ WABTED* SOLVE will fin'd the value of any variiible i^ileli 
Is wanted in tetmi of any or all of the vaciablea on the liet TERM^- 
la u9e> the list lEKHS ia a Liat of uCiit^ » such as pounds j. or f eet j 
which 'niay appedr in tbe anawer. 

the output of SOLVl. ia dependent &il whether the set of equa- 
tions givtft can: be aolvcd for Che var tables wanted. If no solution 
can be found becauae the solution involves notiUfiear proceaaes, SOLVE 
returns with the value UNSOLVA&LE . If no soluCtotl ia found becaute 
Txat enougji eqaattona are ^Usti, SOLVE returns with the value IMSUF- 
FlCIElTlt If however, ri solution la found, SOLVE returns with 4 lt«t 
of pairs* The firat el^tftt of each pair i* a u-ar table, either oti the 
wanted Lts^t^ or a variable whoae v^lue *rajs foutid while solving for the 
d««iTed unknowns. Ihe second Blfrm^nt of each pair ift an arithiaetic ex- 
preaalon (in the prefix nocatloti ehown in FigtLre 2), which contains 
only number a and variables on the liist TEPHS-. Thua ^ the an4Wfl1f fav^ 
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by SOLVE is sa "aaaociation List" ef variables, atKi Eheir viluea 
In the praper terms. 

For exinmple, let ua conalder the aet oi i^v^-n simultanEoiiB 

equations shown be lav, and auppoae ^LVS w%r« d^kad to solve thlB 

set o£ equations for x ami s- Theae aire gtv*n in laflK notation 
for ease of reading, 

(1) X + w = 5 <5> X + 2y = 4 

(2) x^ - C = D (6) y^ - 3y + 2 = e 

(3> C + 3D = 6 (7> 4k - y = 7 

(4J 2C - D = 5 



Vi[S Liflt TERMS ts crmpty, iSnd thus the valuea muat all he num- 
bers. In thla cage SOLVE would return with the list of paira 
**((yt '^)(.^t 2)Cti 0>)>" D^iich indic^tBE that the values x = 2 and: 
z = satisfy tliia set of «quation« (or those meiBbers of this 9«t 
iihlch were used to ietenutne the v4lues>. Hie value y = 1 was 
fcynnd during the solving pt^oce^Sr 

Mast of the work at SOLVE ts done by the function SOLVER* 
SOLVE transBitB to SOLVER the Ust of WAHTED variablea, the lUt pf 
TSKMS, and a null esaociati&rt list (ctfLLed ALIS) -which is fftCur- 
fltvely built up to glvfe the answcrt The value of SOLVER is this as- 
sociation Hat of pairs, iflth the first element of each pair 
being a variable who a a waluft has been found. The second «lenMnt of 
each is an arithmetic CKplfEiiion which may contain any vavtablB 
on the list TEW^ (a? wa« th* C*ee f<>r the ALIS of SOLVE) » However, 
it may also contatfl variables which are first elesient* of p^lrs 
later on the association liat. l£ values for variables given by 
Ister p4jirs are substituted into thia arithmetic expression^ ane 
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gets the drithTne-tic express i[>n given by SOLVE Ccrntdlniil^ Aitl)^ 
variabLea on the l±at TERMS. In the c^Jimplc, SOLVER wjuld r»- 
turn wltli the flsaoclatloB, liat Hy , (4it-7)) (kj2) (ziO)) whlth 
gives y in terms of x^ SOLVE m^ikea the eubstiCuCicTiE and sinipliflca- 
tioif on the association list returned by SOLVER, 

90LVEB is R program, uhich solves for s lisE of wanted 
variables. Xt does this by choosing one of these variables, adding 
th« others to the Hat of temra and calling EQLVEl to solve £or thia 
one variable In terms of the other vjincsd variables 3nd the oti^in^l 
TBBMS. If SOLVE I succeeds in Solving for Chis variable^ SOLVER 
pairs this one voritfble with th« «xpr«3»i<rn: founds put» thts pair 
an the end of the JliLIS , jind using this substitution in evety equa- 
tion it tries to solve, itttemfpti to Sclvc for thii tem^inin;g vant«d 
variables. If there iire no more, SOL'VER is finished ^nd retutnJ the 
association list built up i 

SOLVE 1 solves for a single wanted variable by finding An 
equation, containing thi.s vdridble, ^fte-r all fiublt itutions of 
values for variable e listed on tba ALJS have been madsr It then 
■ekee a list of all the ctfaer v^ri^tles in the equation, And che^lfs 
t<^ see Lf there are eny nnt on Che list lERHSr If so it calls 
SOLVEH t^ eolve for thsBS nev verlebles in terms o£ the wanted 
variable atid Che variahlea in TlfiHE . If SOLVER is unauc ce b s f ul , 
SOLVE 1 tri«:S Co ftnd anather equation containing the wanted variehle, 
and r^p&ats tb& process. If there is none^ SOLVE 1 has the value 
INSUFFICIEHT' If SOLVER ia Hucceaaful, and values for these new 
variables are found* or if there were no new variables , SOLVtl 
finally tallt SDLVBQ -wbicb stCeanE^te to solve thia equation fox the 
uanted variable. If the equiJtion is linaAr in thife v^ri^ble, 
SOLVPQ will be auCeeSSful and glvt a ^plutlOTl. SOLVEl will add 
s psir consiiting iri iph« ikisnt^d variably and this value to the ^nd 
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of ALIS, uad return vith this augja&nt^fed ALIS «s its valve. If 
50LVEq la unauceeaafuLj SOLVEl Cries artOther equatiai^^ l™c then if 
aa ftolution c^Jti be found SOLVEl reCurne the v.-iiue UHSOLVABLE, 

This deacription ti^S been a raChtf long-winded att^n^t to 
explain Che one psge epf LISP progTan eC ehc end of thig chapter . 
to Qiakc it more apeeific* Let ua eons id et whst happens wlnin SOLVER 
tirle* to solve C>it i-tc of eqyationa b«low (the BaD& c»i1ftd ahcnm 

earlier) 5 



(1) 


X + V - 9 


<2> 


K^ - C - D 


C3> 


C + 3D = 5 


(4} 


2C - D = 5 



(S> K + 2y - 4 

(6> / - 3y+ i 
(7) 4k - y = 7 



SOLVER is asked to solve for x and £. It asks SOLVEl to 

a<ilv* for K in temia of j> SOLVEl picka equation (L>. find^ that 
a in**r v^ri^ble, v, haa appeared and aEks SOLVES to aoLve fqr v 
in t*11nS of K and z. ttnct there is aa other occurrence of w in 
this act, SOLVER ia utisutteBsfuL and SOLVEl abfindona equation (1) , 
and goes to equatian (i) * Here it calU SOLVER to aoLve for the 
tw* new variablea C and _& in terms of k and _z. In this case 
SOTiVEH i& auccesiBful, using equstiooa (3) and (4), Imt whftn theae 
values 8 re substituted In equation (2) » SOLVEQ cannot aolve for x 
because the equaticin is noC linear in n* 

SOLVEl now abafldofl* Aquation (2) and the results iC obtained 
aa subgoals for solving (2). IC finds aft occurrence of k again 
in t3)- Again it call* on SOLVER. Co eolvft for the new variable 
2. in tctBE of 2t ac»d z* SOLVER tcies C* use (&) > but EWLVEq cannot 
solve thi* equation for ^. Using (7) SOLVER returna with an ALLS 
of Ct¥,(4K - ?>)>- Uaine this ALIS^ subatitutlng thia vaLne for y 
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into (5), S0LVE.1 callB on SOLVEQ to solve thia equiJfioTi for x^ 
which it i3ofcS, fltid finally SOLPEl r&turna to SOLVER the JiLIS 
<<y^ C4k - 7}K<3f>2>> which 4oe* gtve the Tfalue o£ k in e^i-BS 
of ^, Having fckiirad x in tenna pf ^i £0LV1?1 will noH t:*!! SOLFEl 
to find the value of s. SOlVEl f irtd» an DccurrencE of s iti 
equation (*)* and after subst itut ton of tenM on the ALtS, SOLVE^5 
ts able to *oLve this equation fot *» becay.ae it la lineSr in i- 
Adding the pair (i^O) to the AKESi SOLVEl vetuma It tr> SOLVER* 
which pass** .^rt this ALLS f(y, (Ak ^ 7)), (jl,^), <z,0» to SOLVR. 
SOLVE, y^tfia the function SUBOEDs which Btih a t it lit ga in ord er 
pairs on an ALIS into an expression and slmplifieaT. fin^LLy tetuuflfl 
the ALTS ((? s 1) Ck,2)(z ,D)) - 

TfiiS -CKaiaple ahowa Che rather turtu&iiS recursions that thflS« 
fimctionB uSc E* aolue. a aeC of equations + ^y should we use thi* 
type of BolTlng pfogra* inaCead of a more etraightfoifwaTd oiatriK 
niethcjd? The- principal reason is th^itj. ag showfl, nonlinear equations 
nay appear in th& set> Iti thia case, if appropriate valuaa c*n bn 
found froni other e^uatlotva which when Subatttuted into this non- 
linear equation make it linear in the Tf*riabL« iov uhich we want to 
solve ^ then SOLVE will find the value of thi* variable. 

The fflftthod of operation pf SOLVER vequirea chat if n vari- 
ables «p|>ear in any equation* and that equaCion ifenSfrdj thapat 
Le*st n-l ether independent equations containing these variablea muat 
be in the set of equations, or the actual laechanies of solving will 
not ta started. This eliminates ftuich work if there are txtraneoua 
«quatiOTV<t In the set which cofltain one or two of the wanted variablea,, 
How4V4r, it precludes solving a set of eq nations whi&"h ia homo-- 
geneouE in one unwanted variable , and woulnii therefore cancel out 
in Che solution proceas. This is th* prirtcipal reaaon why probleiris 
Buoh a s : 
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"Spigot A flllfl a tub In L hour, anii flpisot B In 2 
hours r Hpw lofg do they t*ke togcth*r1" 

Cannot be aclved hy STUDENT. 

TtiiB Bolvirvg iybroutlrie aet La an inds*pKrtd.ftftt package in Che 
STUDEHT pi?og!(?ai*. Therefore, iraprovenenta can be msde to It wltVi&Ut 
disturblfia the rest of the proceaalng^ The rout loe tluscrlbfed 
hijrc was dsflifitiefd to handle mo*t t>f the probleiii£ th*t can ^* f&ynd In 
ftrat year algebra texts ^ 
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GUAPTEIt Vt^ ; CONCLnSIQN 



k* Results. 

The purp!;:ia;« *£ the research reported her* v«« to dewelot* 
Cftthniquiea which facilitate netural l^jngaase cpmnunicaCian -with 
■a Comput ar . A semantic theovy ct -CohBrent dlsticiurfle was ptopoEEd 
i* 4 hsBiE fnjr the ■deaign and undeTstaTiding uf such ma n- mach ine 
systentB. This theory H^aa &nly outLtnedij ar\^ Hjuch additlpniil work 
remstna to be dene. Hav&v^r, in it a present rough f<mn, the 
theory aerved a% ^ guide fot Const met ion ef the STUT>eMT System, 
which cart, cmrniun icii te in a limited Bubaet of English. 

Hie lanf{:uae6 analyaia in STUDENT is an Implement a ticn o£ the 
imi) lytic portion of this Eh wry + The STUDENT systera has a very 
narrow a^manXic base. Froia the theory it is cteiir that hy utilialns 
this knowledge of the limited range of meaning of the inpyt diecoLiret, 
the parking problem becomea greatly e iuip 1 i f 1 ed , since thn number Cff 
linguistic faxim that muBt be retpgnized is very small. I f ,t 
parsing flystem were baaed on any small asiiiantic base, this aame sKh- 
;pli£iGat±on v&uld Qccur . tlila suggeaCa that in s general languagfr 
proceeaaT, aonie time might hi; ^pant putting the input Into a semantic 
contest befiot* going ahead viiith the syatattic analysis + 

The eeitiantic base of th* ETDDEJIT language analysis is delimited 
by the charscttriatics ot the problem solving syati^ tTiibedded io it. 
STTTOEMT is a (JUBsCion-anetoering aystem which STisv&ti <3iieatiofts poaed 
in the conteKt of '^algebra story problems*" In the introdttc t ion , 
KB Liasd tQvt criteria for evaluating aevcrSl <]ueat it>n- Answer ifljj aye- 
tema. Let us compare the STUDEMT syatem to these e-thera in the light 
of these criteri^ia. 
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1) Ext€Tit o£ Understand 1tik> All th* othar ^ju^atlon-an- 
swerlrvg eyacems Jiicu^aed analysi: input eenCen.ce by senEence, 
ALttvPufiVi a reprasenfcatl^-rt thf Che meanlne of all Input strttencea 
msy be placed in e«rtfe coomon st{>re, tiO ayntactic confleeit ion Is 
6v6t made between e6ftC>Sincea . 

In the STUDEHT systemj an acceptably Input is * i*qa*nce of 
Bentencc*H such that thie** sentences cannot be underEt^Jonl Ify iuat 
finding the raeanin^gs o£ tbe Individuiil senCencea, ignpflns their 
local cont^JXt. InCET-Eflnt^nce dependencies muat be detenniriftdj and 
Inter-sentSflce aynCactit rftlaCionahips imu-St be used in tht? case for 
aolutipn of Che problafn filven. This cKt*snslon of the syntactic 
dlnienEion »f underst^^nJ Ing la Importimt K-:caiise such intcr-aenCence 
depend entite (e.g.,the use df pronouns) irti very coiroonly used in 
natiirol 1 Assuage communication. 

The aemantic model in the aTODraiT syatem is based en one 
relatiotiehip (equ^lit^) and five basit arithmetic functions. Comr 
poaiti^m of theae fntiCtiona yield other functions which are also 
expressed a a individiisl linguistic forms in the input language. 
Ihe input Unguage i* rUber in eTtpTessifift ftmcCions than Lleidaay'a 
or RaphJlel*? system, thi logical eyetCtflS diacuEsed may have more 
rtlationship* (predic^C-es) allowable in tht input, but du not allow 
any cofiipcSlCion of ctieSft predicates. The logical ti^mbtnatiofts 
of predicsJtes uaed are only thoae expressed in the input a^ logical 
Hji^nibinatiOrtS (usin.g *nd^ or y etc.). 

lb* deduce ive syseem In STUDEST, a a in Lindsfty's and Raphael" a 

prn-gr^B, la dealgned tor the type of questions to b« aaked . It 
can only deduce 9n*w*rs of a certairt type from the inpuC informat i^on, 
that i», aTithmetiC nalues sii.tistyifg a set of n^uaClona. In per- 
forming ita deJiidtiona it is reasonably sopll let icated in jvpiiing 
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iTTslavanC infomi^Ci^n, us iirc- the other Cvo nenElansd. It Lack$ 
the general power ot a Logic*).! system, but Lb SAich inore efficlignt 
£n cbCiSinin^ it* p«rtic;ulAr cl#»« ai deducticmB Checi vould be a 
general deductive iyitem utiLlzing the aKicrnke of eTitliJuetie: > 



2J FaciliCy far Ejctending AblLtttes. ^ictendlft^ Ch* JJfldtactlt 
abilitieB of eny of the afher eitteetion-£A5ueirI.fi|( iyStiibS 41etu9Sed 
would require reproBrHuiBing. In the STU50EHT Jjyitcca new 4tfinitional 
trains fariaaC ions nan be intr^hduced at rwfl ttrrte vtthouC any reprograTii- 
mlng. The tfiforHwatlon cotveertiing these tvafliiEatriifletcnS cSn be in- 
put i-fi English, ^K in a ciombi-natic^n cf English 4nd METEOA, if that ia 
more appropriate. Hew eyntattic trans forma tlona imst be s^Sded by 
extending the pragTem. 

^« seiAantic ^f4 of tht InlUDEtfT «^St«m C#ii te extended only 
by addlrtg rt«M pifonifatiij es Is tr^t of this oth«r question- answering 
ayatema dlscMsa^d- Hflwever STMOENT 1* urganta^d tO £'${ilil:ace 
euch extenslona^ by mlninslKlmg the interact ieo* o£ different parts 
of tti« |>rograTTi, Th« Ti«««4«^ry inforui^atiDn need only be ^dded to the 
prsaram «£ivival«At cf tlu; table of *pftr*tpr» in Figure A, in Chap- 

SimtUrly^ the deduOCivs portion of STUDEHT^ ^ich aqlvoj Ch* 
d«riV6d set oi equatlonj), i« 4n ind spend ent package. Tfierefare^ a 
tlWH Kil: ended solver can b« 4dld«d to the system by just replacing 

Che p^tk^ge^ dnd mfint dining the input -Qut put characteristics of 
this s Librout 1 ne * 



3V Knowledge ot Internal StruoEure Se.eded by lJa.er , Very 
litCle if sn^ tntemsl kncwlcd^o of the vorkinga ot the STUIJENT 
syst&a need be kni:>wn by th« ms^it. Uc pvubc have a firm grasp cif ths 
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type of prtJ-blem that STUPEiffr tan eolve^ and a knowledge of the input 
granmar. Tor ex;nnple* hc must lie awar* tha;t the asjoe phxHse imist 
always be used to tej^reoerst the same -variable In « problem, wittiin 
the limits of aimllarlty defined earlier. He ebubC realiae th^t 
even within thcfic limits STUPENT will not recoEnize more tban one 
variation on tf |>tit«sc« Sut If the U^er does forget any of tbese 
facts, bo c4n *tlll uii the ayatem, for the IriCeractlon dlaeussed 
la the iijQZit AeCtiOA aH^^rw^ liln tP make amenda for almaat Aay nLiEtake« 



4) Interaction With the UJi&r. Tbe STUDENr system is embedded 
in a time- sharing ctivtcoiiwerit (tlie KIT Froject MAC t Ime- sbar ing 
system (13)), and thi* ^Ha*tly fatlUtates Interaction with the 
user. STTIDENr dif fexentiateS bct**6trt Its EallMre to solve a 
problem because of its buS theitia t i ca I Jlfflltatlpna and failure from 
lack of auf fie tent info'rin«tioti+ In case Qt failure it aake the ua^r 
for additional infoxroation, Sid suggests the nature of the netd&d 
Information (TelatiimsblpB emong variables a£ the problonj . It 
mn go back to tbe user repeal tedly £ot ttiforrnfldoti until it baa 
enough Co solve tbe problem , or until tb.« u4#f g,ivef tip* 

STUDENT al»o reports when It does not retci-f$ni£6 the femat oi 
an Input S entente ' Halng tbls Infomuition $S s gu-lde* tbe Mter Is 
in ii tcachlns^macblne type situation, Snd Citt 'quickly learn ti^ speak 
STUDEMT's brand of Input Enjliah. By mOnltOTlng the asstHnjJttQns 
that SrUDEHt makes about the input, and tht global Information it 
asei, the uaer can atop the EyaCecn and reword a problem to avoid 
an utitranted ambiguity, or ^dd uev general Information to the 
gloV^l infoi^iKatlon a tore. 

The cmciiil point in this user interaction is ih^t STUDHIT is 
embedded in an on-line time- sharing system, an J can tli^ia provide eaore 
interaction than any oi the othct s^stemB tuerLtioned. 
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B * EKt ens to iia ■ 

The preaetiC STUDE2JT ayatem Ms reached the [nSKiinuiB aiae allow- 
*blt in the tlSF Si^StttCT on a thtrty-twi thoueaoid w«r<3 iHi ?0g4. Thera- 
f&rft, very l£ttl« tan be added directly to the pce^-GnC ByBCem. All 
til 6 prograTranins esteitsions menC ianed Here are predict t-cd on Che 
enlEtence of r mucti larger memory aiachine. 

Without inventing any net/ techniques ^ I tbtnk that the STUDENT 
system could bt ms^s to understand moit of tha aLgttr^ story pvoh^ 
lemc that appeap Iti first year higb School C:e}Lt baak* + if new 
operators^ a^V toral^inat iotia of a r i t hmiet ic operatipnS occur. k they 
can easily b* a^d^d Co OPFOBMi the subroutine utilcJi maps the kernel 
Engliah 9ent«iiC«« into equatiocii„ 3!he nmnheif of format a recag- 
niaablt in the system can, b« intreaaed without toprograuiBltig 
Chrough the machinery available for storing global informaClofli 
(this 1*89 discussed in more dflCail in Chapti^ir V> + The pT^iblens it 
would viat handle are those having exceaaive vtrhiage or implied 
infacniation ybout the world not eKpreaaiblt ir. n single sentenCflj 

f[S uenClone^J earlier^ the System can now tnak.e uSe of any ^iven 
schema only once in solving a problem. This Ig because the ethemei 
equation la addfld to Che aet of cqu*tions to b-e Solved,, and the vari- 
ables id the SCh^cna only identified with one ether est of vari- 
ables appealing in Che probltWt For example » t£ "dieCance ciju^lg 
spaed times time" were Cha schefliaj then "distance" » «e a vaciable 
in Che schema might be set ^h^u^I to "'diatance tr^iveted by tcain" 
or "^'diBtaeice traveled by plane'% but not both in the aarae prebUn. 
EiiB problem could be resolued by not adding Ch& schema equatton 
diraccly Co tht Set of equations to be solved , Ipit by looking for 
conEiBtftnE sets of variables to identify with Che schema variables.. 
Then STUDENT ecu Id add an inatantc o£ the scheaia cfju^ciona, with the 
appropriate substitutions, for each tonsistent set o£ variableB 
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fi>Lind which *1t* "siBttlBv" to the schema variabLea* 

At the mament the floLvlng aabrouttne of STUDHni tan -ftttly E*er- 
fcsrm linear operations en Literal equiitioTi** attd aubatltiitionS *f 
nunibeps In pDlynoiniala and exponentials ^ It wcnild be relatively 
easjr to add the facility for soLvinfi qujidr«ici& Of even higher order 
irsLvable equations. One CiSwld even add, quite easily ^ aufficient 
TiKCkiaii isms to allow tUt soLvei? to perform the differentiation needed 
to filo i:«lated rate problwro* in the differentisi 1 taleuliis. 

The BemiinCic h^Hs «£ the STUDHiTT ayateui could be eKpaiided. Irt 
order to add the relattCttS r ecoETiized by the Sift syatem of RiiphrJcl, 
for example, ore would havft to- add on the lD(w*»t level of Che STUDESl. 
program Che eeC o£ kernel eetiCetvcea understood tn S1R> Cheir ranpping 
tfl the SIR HMdel, anJ the ^tieaCion-anaweting fOwtlfte Co reCriave 
facta. Then the Sppasfat^e of the STUDENT ey*t«m would proceaa muth 
more CQcnplic-site^ input e t a t esnent a for the SIR rnodel. Otie aerious 
problem which SriSftS When the aemantic tl>-S^3 t* ei£ tended ia based Ofi 
the fact that one ketciel may have an int-Etp rotation in terma of two 
different aemantic biiscS^ for eKamplep "Toih has 3 fiah." can 
be Interpreted in both SIR and the present STODEffT ayatera. To 
fifSOlve thifl aemontle flmttguity* the progra™ can checfe the context 
of the amhiguoua atatcmtflt tO see If there has been one consistent 
model into which all the other statementB hSve been processed. If 
the Latter condition doe* tlOt determine s single preferred inter- 
pretation for the Etatemflilts th*n both interpretations can be etorad » 

In addition to theae iraoGdlate extensions of th* STDOIMr ayaCeni, 
OUT scmafitic CheoTy of diGcour$c; can be used us ;i ha^ti for a much 
ACi-re geneTsl language processing ayatcin^ As a start, one could 
Imp lenient the generative granraar destrib^d in Appendix E to produce 
coherent 4i9£^*Lirae — problema splvatle hy the STTJDEHT system* 
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An[>ther more exciting possibility 1b ta uElll^t tills t^c of speiik- 
er's model of the ^?orLd to attack Yn^e^c ^'l^aB^bsll, stic'tOultt^i'" pfEnb- 
lem« The baseball amuouiiceT haa ceiftalti ^ rgipciiS 1 1 lotti fid^e^ tQ tiis 
world model from the events he perceives , l^fr. Cht b^S?b^Ll geise he 
is watching. Mendatory application ttf c^rXatti ttmanCit rules add 
other propositions, and delate some Chat are tller&* Hhile thaaa 
changes are gotng on, the announce r is t* littsrate a ruTming coan- 
mjentary (coherent discourse J describing thta tall game he ±a watch- 
ing. By makirvft the proper asstiMpEltiiis als^ut iwhtre the tttentian, 
of tfie aniuDunceT is focused, that is, wVii^Vi t>ropt>S itions he is 
going to use as a ba-ee of hie diec^Mirs^ st any tljti^» t feel th^nt s 
recfsonable facsimile cf an anncitincer caa b& piroKri3inntd+ This is , 
of Cout&e, -iTi emipiTically tsB table 1i yp athaa is . 

j^nother uae fcr thia oudel for g^ntrati^n and andlyi&i^ of 
di^couTSQ is as a hypcthasia about tha linguistic b^h^vlour of 
l^eoplek Psycho IcgiBts hava built raasanabli^ c^nput^r models for 
hunian teh^viour in deciaion making (17)j verbal lAatnlng of noneenae 
Sylld'btfiS (15) 3 iind seme problem solving situati^nt (34) » STUI^ENT 
iqdy t>Q A good predictive model for tht bebsviour of people when ccn- 
front-ad with an algebra probleiD to solv^« Tiiis- C^n be tested, and 
Su<ih H study aay lead to a bet t fir Lji^d^^ireCandlng of human behaviour, 
and/cr a better reformulation of this theory of language proceaaing. 

I Chink we are far frcm writing a fstrcajjUfam -which can underBtand 
4ll, <?r even s very large segment of English ■ Bowevar, within its 
nsiirtjw field of cumpeteucej STUtlH^fT ha? demon* t rated that "undsr- 
Standing" machines can be built- lodemd, 1 believe that uaing the 
techniquea developed in thia vmRVi^t one could construct a eystem 
of practical value which wpi^ld temmunttite well with people ici Eu- 
^Msh over the r^nge of material understood! by th« progracn. 



lOJ 



AFFEMDIS Ai FLOWCHART OF TH_E STUBEjin PRCHSitAfl 




THE PKQHLEr-l I 



















I M-fCiftrr^Ti fiiy 




SOiV£ 






C 



4T0P 



) 









Kf ONE 
Slf^iTlTliTli 

THJl 




Cet JNfWl-; 
/\jrii«.ft t 

AJ^ifllvS li 



Efd^T FAlUJBl 



104 



APFgKDK Bi LISTING QF fHE g-TUDEMT PKPfiEAh 



1) DefinicUn of SrUDENt 






dm* 



X Q h* ^ > a. ^ 2 Lu m i-i 



M V* Mid- 'C -^ 



«*. X in M 

_ivt I- — -^ i-J 

■ui — E T fc- 



* ™ If k» " 

w •# 3 " 

Mi i-r ^ in VI- 



OV -^1? K —ID. 

W bj & — -ua D u 

_ .... I'TAm -^u-^ku 

■-F_lwh- » -S. ■ r-1 <■ ;^ .^ r^ ^11.=Kt *«0 E 

D_ -4 j-i.ii_i i-r ^Lj 1-r pri iC # 3i 3 v,! 4^ I -d LTI "H. IIJ 

"E^a r-i~ 8- i-iiir-njif ^ur-a u-_l 

ElAttHi r-»lli » k4*— rH .-.li » y. A Uf ^Q— lb 

O. Bh'— 'iHL ■-HS-m'^ Uf^4HH ^^I^-u'^ Kii«- CC 

ht X ^ T -P liJ -b. ^ ^ En- 0. — _P w i^ B-d UJ H 3- bj 

l-|-»!rb- ^>' I- 3 U 3«3Lni£a ^^ '^ -w* U.rHWt^r-U n 

U.H. u-a^^UriiUI ^OD Zl-MlCU£^ 

i^r-f^p «-a^ Un-u<^^ ^r-IDC--i-LGU.iHi:n r^ip3^D-&i-uD- Z^-^ 

■wH-ir-F m ■'-' '4PH- ^ u» — imi — a. UJ —PI— -lu- 

J J I' ^4 rax i^wuzt -I)-- UZI-'Ui am 
Z a u--* u-upu-in «u.v aL^4jLiii— L1.I1IX ^^ 

— — * -H. w.^^ w f- w- 1- ■-. w-.^ i*3 — a 'T-jP^'^tj 1 

J-i.EC>-in£^ »-HKi-m*-r'w*jfcw ar -^ U .3- Q""" i^' I -^ Om i-m m-i ^O 

M- El «t E ^ HE Z x^ -n. 4 ^ ui %L "^ U_ X «« r-P EE 

V Vi — ■ 3 ? -^-^ sj- o -31 XJ 3 ^«*d 

4J^ U. -i mi-i ™ -■ .ri.-I r^ -HZ i '•-Z'lr " 

Zi-i^i_r 44- 4#Hi -W 4# VH/t 4A «4^ -iJt h ^ w« iM 



1< 

m 

6. 

Ci 
Z 

UI 

a 






■u 

H-. ■ 



HWl^ ^v w- -v 



a _l >4i a^ ^ * £C 

SC UJi^ I- ^t— ^ 

^ in %ri! m --r iBl m laj 

■I ■■ »T- r^ 

htf Oh 

'-■ b K 

in vl £ M 






*C^ ■-•■iH iH 

Si "^ ,e 

k V -d 

■^■^ w cs 

1^ i^ ^ '-li m iu 

1^ 1i OX O ■Jiff 

-ir^^u- ^-Minm t, 

Ul'-i 4^ 4i^ D^iiiiv 

5 £ ^ '^■HiiaE"-! 
* V -i I & ¥ 

■-' >-■ 4 I- Ccf -_iW 1» 












1: t 



3 i 



E 






fit i^Q 



a. 

V LU UJ 



o 



La ^ iil 

So o 



UI Lf Ii1< Lt 



~ a aaaci 



o 1^ 



ti-S 

I U^ n 
^ UI Q 



STC-EZ£izim 
Oamu-u-^aD-E^-;>gg] 



iBA^ki.kOS'P4-« 

o □• n lu X IT Li 

a.E E u 1-1^ 
¥^ UI' uJ V. X £. 



S 5 



i I 



s ^ 

« mi 



lii i-i — I 

U U3tf 






J- 5 

L^H 



IS 



' iJ ^ r>i E-- U. DC 



^C 



UI 

u 



"g 



-"3 " 



!S« 






UI r-I 

U 

!£ UJ 

UI ., — I- 



l=il— .— .— ■C.-^=UI 



►» *I ■ 

= I — 

«f k- 

]4S AC Ltl 
- Z" UA UI J 
Ai3 -I : 



D CJ'- 



^ < |v 

7 U^ UI UI E 
-S- 3- [i 



hP 



l» 


.fW 1^ 


liJ 


-a 


rA, 




^j. 


3 


in j-i 


E 


*^rt 






.bP 


Ufa. 


hrt 


UJ 






IL 


LdwJI 




■ 


s 








DiBi-i. 


e 












= X^^ 


hpi 


N 




-1 




2i£ 


^ 




m 


M 


■w 


■— L 




rr 






^ 


r*i 


X 


=»i3i3r 


iij 


r^ 




I-I 


or 


^ 


:£ Z 3 


:* 


Ilk 




=■-2 


T 


l-Q 3 


Z 






I 




H 


u 


= 


-z 




->_i 






Z 


:£ 


III 


:^ 


u 


u 


u 




n 


ce;-^ i£ 




m 




u 




^ 


u. 1- 






1- 


f— 


^ 



UJ -K 
^■4 



UJ UJ 
L4 U 



UJ ^ ->4 kJ 

tJ.-.-H --..T ^ 

U JZ 3 -Js-k- 5 a 
^ 3 Cr^ 'M uS — CH 
■^ a LG — -L-- 7 + 

^ ^ _. ^ a £■ ■»- 

m^ ^^ dL UI "Z W ■* 

H — ■'^ WF hr* Q> 

■^ «-• «■ ^ -* -3 M' -» -if 
I- £■ O J", O ** ♦ «■ 

UJ^UJ^^it^ lf«.^I 

■_i?ai/iLa-:A -n i— 

C t 

a. C -= r ■ '—I r-I Ld 



5^ 






^ 

-S 



4f <3 

p— <U 



Z d! 






l-P-wLP-^b 3l.> .j w h vc n 4* I-' 



£ L. 

3 i 



5-" 5^ 

•w< ^ -u- rn iH I 

— fc -Iff ^F ir*i WS a 

.-.iniiuj-. .-ri «_^ 

■H- -HE 4 '^- -H- ■¥¥ ^IH-i_r 

■w^i_i%j-i_pUJi_i ^,i«b.^ 



;^ S 



3 3 



4> 



105 



■PH. .H. m 


"ics'"i" 




* 


faJO 














.^U4 


p^ 


rfm ^ 


S S' 


■a 


^ff- ^* 


'-^br wo. 




4 




^^V 






^rf bi 








t— b— 


Oh Ul 


r* 


^ 


aiu m m 


^^n-sui 




r-P!- 




n ET -L? 




Oii^ 




a. ^ 




u ^ 


■3 0. 


25 


■APh' 




P 


w ■"■= 


zcaut 




U 




■-■ I^, 




M# 




a- 




u« 


Lk a 


E^Z 


Ut 




■w* En- 


ui^a 




IE u 




i-i l-OC 




^ 








s 


UJ 


oC 


%& 


U 






M n. 




IV 




L^ Ed 




■* ^ 








m 




-^ 


3 




-^ UJ 










UJ 






^ 


J-B 


-+ 


■_! 


4 








" 13 


■— ■ WF 




Ed 




i3 




^ 




J- 




-^ 












< 


vt + 




s 




4 






rt 




t^ 


•c 




.m 


tf 


^%. 




1- 4H 










f^ 




rt.lft 




r*i 


* 






V 


^ 


«* 


w- 




El_ 




^ 






*l 


r*i. 


rn 


m w» 


-^ 




__^ 




-iB 


f"^ 


IV ^^B 








■ 




n 




Ew-m 




^ 






"a 




1 


" 1-1 


3 -* 




o- 




V 






a. 




& 


H>.' 


UJ 


iH 


r 


,_, 


1- 




m 








■a. 





■-h>n_ 


a 


Hi 


s 










4.^ E 


3' X 




n—r 




U 




D 


u 


^ S 


111"- 


r*r«) 


U. 


rri 


cr 


OM nil 


lil Uf 








■ 




UJ 


3 


^ UJ 


1- 


a-^ 


esi 


1^ 








^ -■ 


_. -t 




^ 




■*t" ■«% 




S3 


a^ 






EE 


"■D 


4 


UJ 


4 


■a a 


-4 u 




^_r 




■— ■ 






o- 


CT^ 


^ 


4, 


■r 


■ 


« 


D -S 


p "^n 








[L a- 






m 


!-■ cs* 


m 


k Z 


U 


O >- 








■Jt .2C: 


■-r-i-n^ 








3^.. 




w 


¥ 


V 




^^ 






r^ 






&- 


., ^a. 




-»i 






■ 


->^ 


Ei^-^ 




1— 


"fl 


■"-ilO 


— JC 


s 


j-B 


■« r-h 






E;s--i 




acx "■ 








* 


^ 




■ 


i>l 


■H VP 


UJ 


« ~£T 


t-i-O 








UJ1? = 






*^ 






"O 




^ 3 


-J 


■c 


u 


— i^ W 


«> + 








£f HI U 




^ 


i_i 


■^ 




■1 




in E 


i£ ia 




^ 


HH Trf 


^ WJ V 




*^ 




C^ lil 








•-I. 






^ 


3 I-' 


m Ch 


^ 


O 


1- 


¥*«£ 








Z LiJ 














-u- 


^■5 


■u' a 


-1 




ID -Hl ■™ 




H^ 


1 UJ -Tri 














a 


J— 1 Ti^ 




P. 


f% 


Q 


X 


W"! ^ 


~CP M- 




H3 


i-i 






D 






.-^ W 




Ur«i 


p 






HhE 


Si 




4W 






j-^ 


^a 








h 




U ■ n 


sE» 


^ 


-i£ 


■**^ " 




■■ 












r* 




■9 


-^ 


4 -kr -u- 




r 


^"i ^ 


tf * 


D- 


z 


tAu- 


Z 


^T.rc 


T 


r-ii 




i„ 


J— I 


Hi 


M 


i^HJ 


■U. Htr 




wHnj- 


■^ ■ ^>- Uf Id ->ir-i 


ur 




UJ 


r-f 3 


«i^ 


-H 




8 


■Opt. 








■ef 


M 


D"'^ -1 


i^* — «-« 


-L 


.^^^ 


-1 


■^ 1- 


-1 


n< Ji. 


o 


bit 


■^^ 


-^ 




tf^ 


h 


_lr*i 


S 1 iQ 


a t- 


Q 


H-u^™ 


1^1- u 


a 


■I* 


^ 


"3 


•^ 


1- 


t 


^^ 


■ V 


Hr< 


CiM 


n-^M^O 


^I w 


■J 


i^^ 


a 


U. B 


u 


iVa 




•^ Lil 


^-ir*i 






B^ 


-4 UJ-rHEE 


UI^UJ 


EX 


m-riE 


T£ 


^UJ 


££ 


* ■£ 




UJ 


ii'» 


O k UJ 


^ 


m la _i 




-H. li^ 


. — ur 


V. ^ IV 


'■^T#i£ 


o,< 




D. 


ir^-i a 


c 




v^-^ ^ 












Z-i 


3 1- 


m 




iij^ 




^^Z 




5- 


^►3 


.r^ 


4 


^^ 




1^ a 


"^a 


■H HJ'tuK Nl±.4 


■4 


&• ■■ 


■^ 


2 


P 


<1C 


UJ -4 


Ujyi. 


-<" 


^ 


ui-H<S. 




s 


Z4 




^n^ ifc 


t 


b^ 


¥ 


V 


t 


E'* 


^LJ 


^-W 


f't 


" t4 


P 




■ ■ 


-•^ Hi--a->- 


^ 0^ «iHn 




■K, 




a i^--L. 




-j--i# 


J- = 


*c 


n 


J ^O 




-a,. 




•s 


1^1 1,^ 


•S"i" 


■uHh 














t- 


^ 


"li- 


t-^-f- 


' F KH 1 




hi^^^ 


-?' - 




4-4* 




hl^lilL 




T 


T^UJiMi — 


l4IiJ 


^irt Ul 


IZ. 


1-1 yj^ 


^ 






■"»- 4* 




** 




Z-.-P 




-i^j^CJ 


■" " ,rt 


VI- H^ 


n^'^ 




■. 














P- 








■1 


1 


ii^ 












r-k ■— L 


. ^ J-B 


=I-K 




**i 




■r^ 


r4 t 


5 


4 


* 


^ 


lt-rfcn--i 


w^ 


T-ir^ 


■ -H- -H- 


w- 




■■•P' 
'^*j 










2. 




a. 




■v^ 




^ 


■vr 


■W«% 



s 



Of » 



K -ip Ul 

Z teJ ■« O 

W45 * H r 4 








1^^ 
■^ 


s' 


■ 




,-»l 


r»-..-T. 

In. 


2 


■b>En 


s 


P o" Ul 


■^ WX 3 


-»i r*i. -H *^ 


ai 


San 


»-- 


«1 * - CT« 




E 


UJ kU kB 


uJ« 


Uii^ 


^ 


Z O 


_J-_r 


r>a,ci 


3 J -d; 


■g da a-s- 


PS i 


_l UJ 




^ 


3 


E E in 


X ur 


Jt- 


f 


u m-n- 


mo jc 


u X 


'c -• I E. " 


rt 






l# 


EL p_i r^ 


'^ ■% 


EL 


ft. 


^ El_ 


v>^ 


PT-- iJ 


K-UJ3 


iTI^SS 




a ^ 








C3 a 






E 


U- UJ 


a Hi 


UJ E£ Z 


UJ ^ UJ 




z,^ 




lU li^ 


CT 


3 


o 


UJ 


- CT <^ 


1^^=* 


i^i 


' «■_.. 


VI 


L .r-i. 


I' -4 




Ei^ 


Uf 






1- 


■q 


^WIU 


K.P4 




■^ 




i 




1- 






>i !" 


•^ 


^rt 




rd 




A 5° 




^i 


H" 


& 


e 


u. 






w 




s 


It 


1- a 


■H iW 




■tat- 

— u 


i 


£ 


m 


w 






1 


w 


^1 


Ul 


■H 


^l* ^ 




*" 


■*i 


a 


3 


U- 


r»i 


S 


.n 


g ? 


3 


UJ 

> 1- 






m 


S 




rj- 


^ 


■-1 


UA 

41 


S 


«U B-B 


Z CL 


4* 


_| qj 

o ca 


i5 




E 
1- 


IW 


N 


1— 


-I 




^ 


a 


■^9 


m M ^ 


" CF 


bn 1- 


E«* 




-^ 


X 


CI 






> 


E 


•^ 


HbJ 


K -Hi-r 


.rn Ul * 


UJ' 


■^O 




Ul -u. 


f 




s 




^rf 


^H 




''■^ 3- 




"*> hH 


£ ^ 


— u.i!i 


r*i 


a-dM 


-< 


E 






J-L 




W Z"*«- 


«■* 


irta- w 




<S 








u. 


Ct 


3 


^ 


■N 


— HH 


-M 


S £ 


^^iJ 


s 


eX 


^ 


At 


Ci- 


■_il 


IC 


^ 


T* 


■4-ba 


5^^ " 


flf 




< 1- 


-E- 


Ufl 






Cl- 




3 


r*-. 


Hr 




*I- 

^^^ 


i 


1- ei 




M 




t-1 


Ui 


■r 


rjE 






> a 




P 


IL 

o 


i 

3! 


-i 


-»» 


^ 


1 


1>J 
■(3 


aiu 


Hi 


+ ti 


h -f 


3 


D 






? 


■a 


Ifl 




Z 


d 


3 tjw 




■w O- 


^-^--, 


■H Z 




Ul 


E 


P 




u. 




rj := 


dl 


3 ■•-■ 


1r~ 


■M-4 








-i^n- 


d 


^rJ 


^ 


ad 


tf'^fl 5" 


1* 


HSB-, 


B# 




o 


a 


3 


U 








□ -> 


X 


■_crB 


-^ o 


H.UJ .m 


Of 


U^ ? 








CL 






H»- fe 


a — 


v^ca -^ 


Bi f 


a in 


Ufl 


i^;^ 






X 


o 


«_■ 


*J 


S I 


? ^ 




'1. ^jT-M-Vl 


L V 


* a^ 


o 


c? 






Id 




■— !b 


£^ 


JT 


i± 


*1~ u.^ 


■^H-ii 


Ef 


^E ■■ hC 




Etf E 






jj 


4 


w 


Q^ 


Ul. 


-~ ■» 


"-,SS- 


S -H. 


vHh^ p 


3 


Ul UJ 


J-h 


^^■"1 


m 


t 


^ 


_l 




i3 


~ 3" V 


N -u- 


a--hr-i^ 




c--^ 


•» 


W1-* 


at 


HH 




ihmk 


'^ 


— sc 


-^ 




^ 


*s -^ 


^ H 


s- 




53 


' 4* 




;»is 


m- U1--4 UJ 


X-r-i 


XII. SeS 


2 UJ J 


UI^ 


Ul IH 


£. 


IJ 


LQ 


a.--" 


u U UJ b_ 


4 ^ Df u- 


U_ ^ 


_1E-- 


}^ 


u N 




fe 


Q.IT _J 


az" 


1 UJ 


■"■"t^*-^ 


4 UJUi 


"g* 


UJxji- ^1 


«-o 


lE 1- rt 


-£ a 


iij i-^^^ji- 




-Z -I 


— UJ £ 


4 n. 


1-1 i ^^ 


'' = '"5 


1-^ — ' -H-XL 1— 




?3 


3" 


I . 




'^u 


UJ _L 


nc J _l 


IT -^ 


m iH 


UJ 


UJ E1_D'^-M> ^ U. D 


■1- 




3- lE 


w£ f^-k -^^ 


^WrH 


^ ^^ 


T»- 


a ^^ai^ ■-■ Q I? 


X 








d^-u- 




Et. 


-ur 


r^_J -V M-'^in «■ 








El. a z ■■ 


Sea 


M £ 




s^ 


^^3 


3; 

P A. 4 


..1 4 lu 
DiUJ — 

■P ^3- a « 


^UJi 

33 


3 1 -t- 1- 



10^ 



« z ■ 

Ua -^ I 

Z *■! 

id 44 



zo- 

- UJ In I 

E <-> I 

'1* 



Id 

4 



f'^ AM -^ rJ, 



Ik-O I 5 1- 

hz Ik. —-a 

« U If- 

i" SI 






— ^!^ -'^ H^ -K 



sc 






i * 

d -I 

o- 






'4 nQ'--' 



•^ '^ 



lU 

ml- 






n4 






kl ■>T' 

■ H 



a. I 
idc! . 



■=5 



IS 



H"^ ^ 



of H 



B ^ ■_! r^ — I , 



I- -K- O 1^ ^ -ij- J— . 



J—. * 

i^ z 






lU-ui 



4 -v% UJ 



4 

■•cr 



IZ Z ! 
a- u. 

:5"' 



iW V^lA l^ 1^ ^H O -^ 



C£ LllW 
■-I V B \ac 

u-ld = —tp 
-fm □ u a. 

Ill H 1^ O 

HH H^ ■ — I— v. 

mo .^ >-i. UJ 

— H • ™n> 

z^Q )e£ EV — D. 

k « 114111 at 

4 v^U ■ 



0. w 






49 
Z 



4f 



HliaP4U1h« 



3^ 



? 






3J 



o en --r ^ 



en; 

3 

■J 



I 1 Lrt- 

Z f«i ur 

— M lA 

^ >^ z 

u»™ 2- 

°. « Z 



a 






Z' 






* + >"**♦ 






p 



ii 



b, ^ H » 

«« m m— r 



-!# U+^FCS-i-- 



^^ 



P **< ^ E 



i :: 



I UJ £• UJ 3- LJ iH XJ LJi l_J 



* Z 
K Z 



1 Iv 



tp ^ 



Z ^ 

3E 



2 S 



rS3 "S 

EC 3^ Z ^ » -~ 

O •- a,t- -1 

p>- I— _j u.-_j.rtouj i^-^f 

K A a^ j^ tj j^n.a. Jul — «, rt ^. 

4a CT** li-^ m-B0-O x^'J « r-i 

"" ~ T~Ztt gM a ^ 

^ L^x-u v.-*^ U. UJ_IZ —I * UL UJ U_d 

u. m 4^ x^ ^ >du. aw H -^ut 

w ^ TJ.II1 a^j. F^ — 1- 

ClC k£ IC U J -^1^ '<K,z Z '-Z 

' u- -» KW < Vii™--! SlC la ^ 

Irt ■»-» W •-« 4 S r-t ■»»"j-i ■-< O 4 O -1 ^ 

-- £J 4 xx r^. 



X P 4 * ' 

UJ u- %>■ & 



^ 






I- r 



'5^ 



OC X i-rT S ■ bJ ^ h -H-_p 

5^ -- > g ^|t wi « ^p+ph n^ rr 

■TH Id ->n.O ■■ O O: -V r-i. -H- ■— «V a 

jx ^u. -uu. -^a p^ MO. ih ^s 1. 

i-r -ni U b_ a_ -s<- U. -u-^-i-uei^l T^ --fa u. u. -±1 

^ ujii-bx: 9 D -kO Ik ^^mjiL. -H-uD. z 

^tu-s- ■*' Q taH^JCQw ca 

"■ z:fcLj -uj^ ^^ iw- u. Trji-.^ij^^ ^K 

^ ^ -^-^ r^ ^--raniih h^ ill rt. 

X U M !Lb ■■ a-i ■-■ -M ^ >« "^ZS^i-i- LL 

^ n x^ j-ii ^^ ^ jt i_p »^ H U^ Tj. a m iB4 

U D TH •■ 4 iD V "^ EL =- M Lk pri U 

4^ V -^ 4 uii -a d ■«' o 3 _i ^ 



Sdi W "N & tJ B A E -HZ «k A p id ^ u. ^ 

_1^_ Lii^O -^O -^E 4-Ha wLiJIEA>Q U| 



If r-L-Til 4 -r^ I— 
■I ■»■ ^ GP 

x^ n. H" «J Uf 



X 111 i^ Id f^ 

A a. ■— I ■ bh 
Lii- -^ ^ ^■•-'hj- 
« V3 * ^ 

'^ ^ -ur ^^ "^ •□ b> 






V E % a -n, ^ 

3 D X : 



» 1L«- -Ki 

4> V Z. # iC Z Q2 VH 

__ _ -auj^M- kj-vAU-u. 

U-Lli U-UJ Xfcd M""-! ia-l^iH'>-L3iuj|^ -•n-^K -n. -^ 



St 






■H 4 4 4. 



107 



2) bttiiiiitiOR of the Function OPPOW 






H J -cr: i** .-i UJ L 
li — - a 0« wHp ft.-P- 



Hi ? 



^ 5^- 



a. a I u-r^<r hd u- d ^u. 



Ui «■ -n- 43 u- ^-i,.!^ u. a. u- 

— n- I— — lU ^ Hh^HH Ml I? ^ 

I- 1-^ « « -■ 1* ""J -■ Z 



■J 3k UJ w* 10 ^ 

X X 1- « Ik E S 

^5 a u — o '°-^«- 

tU- r-1. b :C C> -^ U. ^ -u- 



9— j-i fa- n «a K U. 1^ V> Ik- H Z z o- ^ 



2- -n X ■>■ ^ O .n. L^ X -u- .rm. UJ IxJ 4J> O ^3 i— .j-i U. U. — E;]L1J U. -"^ -I 1^ f^ E 

Ox^^Uli^—lU. i^^d <^ -MU^A ^I|J< Lj-n-i,J 1-1 O^j-i lAt^i-J-ri.^ i-r r-^O 

— ^ — F**t M5>Wk4N -3 ^-^ ^s ^i-. ^"_"''5^_. ^S .""5 

M<l— O-a^^ Ur XZi>' lr>-UJ<»xJUf bJUln.a|^K_ 3^iLUQu-l— Uja.Lt_— ^-l^r-U 

Z^OOt^^ _riiE3>T_i i3<«.U i« I- B ^K Pn-fZ biT^i^eif >-''•-■ « d 

ui i_ = B ir -.J iL pril bJ" I "H IU_W E^,ji^«uiu.iiu n -^Ul 

■ T Q- w! i-f "X -v U. X" trtr-^UM i^ r^ C» K E *£ ^ ^ I^ HH Z^X lA 

upS iA h □**■ i> ixjiij ujs Hva ■ — — a w r m Q'^-a 

■Ha w0 3 '.- w'.j □ Ai> «. a.i-j'-iz i-Jif zzv i^-^--r^u. m^cfiX 

^ D. ■»-. -^u. — -aS— -irS-mf^ ZvrWri — ^ S O S ^ VI 

^Kis Jitii ri^ * * t 

AiuLJJ'^ n^^rH ^n--irHA ^h i-i ^ Ar^ i~tr^ BflUJ ■ 'ruj-^ ,m, ^ 



J £ L4 

5 I- irt t 



Ad o: Iri Z I 

LU LU UJ ^ ■ 

6. fe ■■ A. -■ z : 



** a IS UJ ^ ™. 

.^l-p^^A.* ^Ek.* B.LJ1* D.K. ^ 



a 

u 

EC 



^ r-U^ ^UJUJ bUJUigU i^puJUAUJUJ kUJ -Ol U>.Q LU UJUUf 



. _i a. 
Ik ^ m e^ -^i- 

bJ -u Id 



I S - 



lid Z 
■Jtu- 






h rl !*> 



Hh i-fc.-^ Pr -«■ 

■H ^^ ■ 

1^ 'bd "H- i-L 

U. IE ■■ -r^ 

4H» m -a r», M- 

, >,A X,. UJ 

■^ . * "T ** 



uX ^ n v¥ 

5 3 rt r"--* >^ " ^^ J. 2? 

OK-u'-'E '^ U _ ^ '^'^ ^ •*• ^ 

Z^SE-H^iU^rJHr^ Pl"Z 1-r ^-1 *" 

V »x,E-,-. LJ ri #» - - ■ "■ 



& ^ K u. 

ao iL It 
It - ' 



V* V UJ 
hH — ' 

1 ft T 

5 t X 

o a 5 

"*■ Z"t: 

X i-r.«,0 



r- IE 1- t-^ ^1' Z "- ■" PrHli * ff " "J "'^'"'^ 

j,4»» W 3 1- P mf^^ ■"■ « -rf^^P N 1^ X If —1 "^ 






7^~Z.^^,n^Cm^ ^ ^-1 lA j-M 1^ -"■ w ii^%j-m* -H-iA .■>i-4-J«#d£>Aadl->^X^^ 

-^r M.-iiaatwiA I^K,«aL5i.' -.Jlfl-"i-J-U-HO"Ilf 

^t|Ji „_ n Qw u X i-r ~ — -^ k^ --- 

'^-' E""; '** Is :i i « - ' 



■n- U. U_ 1^ --r -IviUS^-^Xli^ 



■9 '■ 






of 



a Si i5 i-Eii.it-» 



loa 



AFPEMDIX C: GIjOML INPORH^TIOM IH STUDENT 



REHEMBER(( 

{ PEOPLE J 5 THE PLURAl Of PfRsfl*!) 

(FEET IS T11E FLUAAL OF FOQTJ 

CTAHDS 15 TMi PlUflftL OF TAHD) 

(FATHOMS IS THE PLURAl DF FftT>lO*l> 

UwChES is the n,URAl OF INtH) 

(Sn-awS IS THE PLURftL OF SPAW) 

(ONE HALf ALWPTS MEANS fl . 5 Ji 

UNRE£ NUMBERS ALWAYS MEANS tH£ MfiiT HUM^EA AH& THE ilCOHB 

NUMBER AND THE Til LAD MUMPER) 

(ftRST TM NUMBERS ALk*¥S MEAKS 

Till; FIRST NUhBEA and the SECOhD !iUHSES> 

(MORE IHftN ALWAYS HEAN? PLUS} 

UHESE t-L'Hf-fi MEAJJS THE} 

(THO NUKEEna S3mET<ME5 HEAN5 DNE NyKB£R AfUl TH£ 

OTHCft NUMB£fi> 
(Tito NUKBER& SOMETIMES MIAhS ONE CF THE 

hUMB&AS ANt tm OTtiEH NUMfiEA) 
(MAS 15 A VEROJ 
(&ETS IS A VERDJ 
(HAVE IS A VERB3 

CLEAS THAH always r^EANS LESSTHAWl 
(LESS THAN 15 AN OPERATOR OF LEVEL H 
(f^ERCENT IS AN QPERATOR DP LEVEL 2} 
(PERCEH-r LESS tHf,iri wM'^t MEANS PEAlESSl- 
(PEr!LESS IS AN OPEPfiTOR DF LEVEL 2J 
(PLUS IS AN OPERATOR QF LEVEL 2) 
(SUM I S AN OPERATOR} 
(TIMES IS AU OPERATOR OF LEVEL 1} 
(SUUASE 13 i*W OPERATOH Of LCvCL 1) 
(DIVST IS AN DPERATQn OF lEVEV I) 
(OF IS AN OPERATOR]! 
(CtFFeRENtE IS AN OPEAfiTOR] 
(SQUARED IS AN UPEAATOR) 
(Mtljgs 1S AN OPERATOR OF LEVEL 7} 
[PEl? IS AN OPERATOR] 
t SQUARES IS AN CPE3,AT0R) 
[VCAAS OLDER THAN ALUAV& MEANS PLUS} 
ETEAAS YOUHCER THifts ALHftYS MEANS LESS THAN! 
(IS EEIUAL TO kViikit MEANS I ±J 
£PLUSS IS AN OPEAATDR} 
(MINVSS IB AN OPERATOR) 
tmi4 OLD ALbfi--,TS MEANS HiHAT] 

(ThIE PC Kl METER OF $1 RECTANGLE SOKETEMEg MEANS 
TIJJCE THE %m OF THE LEH(;th ANO KtOTH Of THE AECTAMaLEl 
(GALLC^NS IS TNI PLURAL OF GALL(}N1 
dlOURS I& THE PLURAL DF HOUA) 
(J4AAY JS A PERSOJJ} 
imn IS A J'ERS&N) 
tRlLl. IS A rfRSD:^} 
(A FATHER L& A PEAgON} 
(AN UNCLE IS A F'EAS'Orr) 
(FOUNDS IS THE FLUHAL OF POUND) 
(UEICliS 15 A VERB} 
]} 

A£iiEM[lEA (( 

(&ISTAMCE EQUALS SPEEO TIMES T)NE} 
(D (STANCE EQUALS CaS CONSUhpt tON TIMES 
iJUMDEH OF CALLOHS of CAS U3EC}3 
(1 FOOT EQUALS 11 INCHES J 
il YARD EQUALS 3 F£ET) 
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AFFEKDIX E: a small SEaJAKTIC CEMEBflTIVB TtIJAMMAFL 

The gr^Uranar out Lined here will g«neratt only word prcsblems 
soLvabLe by STUDENr> though not the «st of all Bach probleuiar 



EIJI^S 

Create s fiet of aimulta'cie- 
ouiB equatioDS 'which can be 
solved by strictly linear tech- 

niques, except th*t *ul«|ll:ii- 

tion of nuittETical values iti 
higher order flquaet^ms ujilcln 
If gd Lire them Co l.iiie:9ii^ e^ua- 
cloeiB is a 1 loved » Unese are 
the progHiaitiiiTis o£ tbfi speak^ 
er'a [itci<dsl# 

ChDDse ankfirO^rte for uliich 
STUDEHT 1b to St>lvet Thia la 
the queationt 

Choose uniqi^e name* tor 
variables without artlciLea '*a" , 
"m"t or "the''* In the ftdb- 
Lew any ot ttiesa iirtlcLes may 
b^ v^^d at any ocGurten£«: £>£ 
A iiaTn«f In a complete mod^^l 
the34 names tJouLd be m<3ii^ 
^t^4 ulth the objects in. th« 
ch^d^ti propoeitioiLEi 

Wflt'S one kernel een- 
t^ftce for each eqiiStioti* Use 
any appropriate lingwiaElc form 
given In the t^blc fellow to 



EXAHFL&S 
2x + 3y - 7 

y - 1/2k 

2 
y + 2 - >4 



K ■ fir^t ciLiinber 

y ■" BcCc[¥i nuKiber Tom chose 

z - third number 



'"2 times the ft est niimber 
plus three ttin*9 the eecond 
number Totti chose ia ?■ Hie 
aeeonii rtwmber Ton choee 
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Whyt ia _ 




Fiiu^ 


and 


What are 





rcfit-eianc Eha arittiaetU 
funccione In the equa-^ 

For 9*ch unknovn vhiji^e 
vaLii^ 1$ to bfl found, nee 
a kei:n«l scntenjce of the 

foroi: 

Fin4 

'"^ 

atld 
for more tlinii one such un- 

£f a name a^ppeat:^ Tnorc! 
than onCe In B probleoi^ ^n^me 
Cor #11) occur rencea aft*£ 
th« first may be replatsd 
by n "similar" aawe, Slnl- 
l^r names are obtalneiJ by 
Ct^nsfoTHatlona vlilch; 

a) insert a pronoun 

for a nouti phrgse 

In the named 
b> delete Uitial and/ 

Di: terailnal sub- 

Btringa of the niimie. 
Only one inch "slBilar'f aCring 
CRH be ujted to Teplace an 4&- 
currtili^^e of a name^ though 
any tininber of replacements 
can l>c ludQ. 



KqiL^ls ,5 of the first 

flunil?cT + 

The Btm of the aeeond. nunt- 
bcT Tom chose and a ttitrd 
number is equal to tht 
aqmare of the firet mnn- 
ber. What la the third 
niimherl * 



Sljidllae inMiveoi 

"first" for "flifit number"' 
^^eecond ntmber h* ehose''^ 
for "secD-nd ntnnbcr Tom 
chocs'" 
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If 'S\ AC cur B in 3i snd 
S ^^^■. t stid. In S it ia the 
ttltirt substring ta the left 
of ^'■lg"j "&<]uals" or "ia 
*^u?il t*" (or the entire 
BubstriTig to the rigHt) then 
in S ^* H, Oisy te replaced 
lny irty phrase containing the 

Jiny phrapft V may be 
replaced ty artOChtr phraae 
Pj -which HiBdnS the same 
thing* This would mean that 
SmOEffT had been tuld oi this 
eqiiivB 1 enc^e uain^ REMEMBEiB. 
BTid the B entente "Fj always 
meana P^ " or "P^ acme times 
meana P. " « 

Two ceirt3«Cuttve aen- 
tcDices may b« cisrmected by 
rep Lacing tha period after 
the flTBC by '\ ^tlJ'V A 
Jtetttance can be ccmtie&Ced 
to a qLiBBtion by preceding 
the EenCance by -"If" atid 
replacing the per ted at 
the end at the £ent«nc« hy 



pr pr 



Replace "Che second number 
Tom chcsc" l>y "thla aecond 
choice" iti the third ften-* 
teace ■ 



RepljiCe ''2 tlmea" by "twto*" 
and ".5" by "one half". 



CotRieet i^enteiices 1 and 2, and 

H entente 3- ^iWl thfr final queS- 

tlon to £,iire : 

"Twicie the fir it number plus 
three tines the fle£(?nd 
DMmbET Tom thoee is 7> and 
the second numbet be choHs 
is one half a£ th« fir^t. 
If the «um d£ thi« jcOdml 
-chc^ice #nid: # thit^ nunbcr 
is equal to the S^u^re of 
th* iiv9t nguber, wivat ia 
the third nuiaber?" 
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Sunmi^iry of Lin e'J i ^ 1= ^-g^ Forme tn Sitprt§§ Arithmetic Functions 

and the fi^juallty Relation 



X = y X Is ¥i Tc eqv^alB y; k it ftqua 1 to y 

X + y X plus yj the «UM of K and y; s more than y 

Ji - y I minus yj the diffeeetice l^frtWeen x and yj 

y leas than x 

K * y X tljnea y; x Hii^Ltlpli*d by yi x of y <lf x 

is t'l number) 

K / y X -divided by yj x per y 
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